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Introdu ctio n to lo oS : : | : : : . : : 


This applies particularly to 
Chapters Bl and B2. 


The Latin word calculus 
means a stone. The link is in 
the use of stones for counting. 


The Summary of Block A, at the end of Chapter A3, described the 
difference between discrete models and variables, on the one hand, and 
continuous models and variables on the other. In much of Block B you 
used ideas from discrete mathematics, such as recurrence relations and 
sequences, to model phenomena that involve change. 


In Block C you will again be modelling change, but this time in cases 
where the change is continuous or can reasonably be modelled as being so. 
Block C addresses questions such as the following: How fast is something 
changing? Is it increasing or decreasing? Does it ever stop changing, 
permanently or temporarily? How does the change accumulate? How can 
the change be described by an equation? 


Chapter Cl concerns the rate at which variables change, whether the 
change is an increase or decrease and whether the change stops. A 
mathematical process called differentiation is introduced to help in 
analysing these matters, together with some illustrations of its practical 
applications. 


Chapter C2 takes a different view towards change. The process of 
integration is introduced, as the reverse process of differentiation. 
Integration permits an accumulation of continuous changes to be described, 
and one significant application is to finding areas beneath graphs. 


Integration also features importantly in Chapter C3. Here it is needed in 
order to solve a new type of equation, called a differential equation, which 
arises frequently in mathematical models of phenomena which change. 


The branch of mathematics which includes the study of differentiation and 
integration is called calculus, so Block C provides an introduction to 
calculus. The history of this branch of mathematics goes back to the 
second half of the seventeenth century, when Sir Isaac Newton (1642-1727) 
in England and Gottfried Wilhelm Leibniz (1646-1716) in what is now 
Germany both developed independently the basic ideas of calculus. The 
word ‘calculus’ here means ‘a systematic method for solving a certain type 
of problem’, and what Newton and Leibniz did was not so much to 
discover the basic ideas as to bring them together in a systematic way and 
then show how their systematic methods could be used to solve problems. 


Unfortunately, a bitter argument grew up over who was the first to 
‘discover’ calculus. This split the mathematicians of the day (and for 
decades afterwards) into two camps and hindered the development of the 
subject. In fact, one legacy of the dispute is still with us: there are several 
notations in current use for expressing exactly the same thing in calculus. 
One set of notations can be traced back to Newton’s work and another to 
that of Leibniz. Other notations arose from more recent attempts to ‘tidy 
up’ the subject, following on from the greater understanding of calculus 
and its associated ideas that has developed since the time of Newton and 
Leibniz. You need to be able to recognise, understand and occasionally use 
all of these notations, even though you will probably settle eventually for 
using mainly one of the possible alternatives. 


Many branches of mathematics, science and technology have made use of 
calculus, which indicates its central importance and usefulness. ‘The 
chapters of this block refer to a range of contexts in order to illustrate the 
applications of calculus. 


There are five sections in this chapter. They are 
intended to be studied consecutively in five study 
sessions. Each section requires two to three hours. 


The pattern of study for each session might be as 
follows. 


Study session 1: Section 1. 
Study session 2: Section 2. 
Study session 3: Section 3. 
Study session 4: Section 4. 
Study session 5: Section 5. 


Section 2 is the longest and Section 3, which 
contains some material that can be omitted if you 
are short of time, is the shortest. 


Subsection 1.2 requires the use of an audio cassette 
player, and Subsection 3.3 requires the use of a 
video cassette player. Section 5 requires the use of 
the computer and Computer Book C. 


The optional Video Band C(i), Algebra 


workout — Differentiation, could be viewed at 
any stage during your study of this chapter. 


The use of a function to 
represent the relationship 
between a dependent variable 
and a single independent 
variable was introduced in 
Chapter A3, Subsection 1.1. 


Polynomial functions were 
introduced in Chapter A3, 
Section 5. 


This chapter introduces an important topic in calculus called 
differentiation. If a dependent variable y is related to a particular 
independent variable x, which can vary continuously, then differentiation 
enables us to find the rate at which y changes as x changes. Put another 
way, this rate is the gradient of the graph defined by the equation y = f(z) 
that relates the two variables. The expression for the rate of change in 
terms of x is called the derivative of f(x). 


Suppose, for example, that the motion of a car along a straight road is 
described by expressing its position relative to a fixed starting point in 
terms of the elapsed time. Then differentiation gives the rate of change of 
the car’s position with time, which indicates just how fast the car travels 
during its journey. This rate of change, known as the velocity of the car, 
may itself vary with time. Differentiation permits the ‘velocity function’ of 
the car to be obtained directly from its ‘position function’, where the 
independent variable is time in each case. 


Section 1 defines the process of differentiation, illustrating its application 
first to quadratic functions and then to power functions. In Section 2, 
differentiation is applied to polynomial functions, following which you will 
see how differentiation can be employed to find the maximum or minimum 
value attained by a function. The process of finding these values is called 
optimisation, and is of use both for modelling purposes and for permitting 
the graphs of functions to be sketched quickly. While all the examples 
considered here involve polynomial functions, the principles apply also to 
many other types of function. 


In Section 3, differentiation is applied to a wider range of functions, 
namely the trigonometric functions sin and cos, the exponential function 
exp and the natural logarithm function In. The results are incorporated 
into a table giving the derivatives of important basic functions, to which 
are added two simple rules for differentiating functions defined as sums or 
constant multiples of those in the table. The alternative Leibniz notation 
for derivatives is described. 


Section 4 gives three further rules for differentiation, which apply to 
functions that are defined as products, quotients and composites of the 
more basic functions considered earlier. The results in Sections 3 and 4 
permit the differentiation of a very wide range of functions. 


In Section 5 you will see how optimisation, based on differentiation, can be 
applied with the help of the computer, in the context of modelling various 
real-life problems. 


To study Subsection 1.2 you will need an audio cassette player and Audio 
Tape 3. 


In this section you will see how the idea of the gradient of a straight line 
can be extended to apply to any smooth graph. After an informal 
introduction in Subsection 1.1, you will see in Subsection 1.2 how to find 
the gradient for any quadratic graph. Subsection 1.3 formalises the process 
of finding the gradient for a curve, which is known as differentiation, and 
shows how this process can be applied to power functions. 


1.1 Gradients of curves 


A linear function which relates the dependent variable y to the 
independent variable x has a graph with equation of the form 


y=mrz-+ c. 


Here m is the gradient or slope of the graph, which represents the ratio of 
any change in y to the corresponding change in x. In other words, we have 
change in y € ~) 


SS —- 
corresponding change in x 


rub 


as illustrated in Figure 1.1, for m > 0. 


Figure 1.1 Gradient m of a linear function 


The graph of a linear function is equally steep at all points and so has 
constant gradient. This is not true for the graphs of non-linear functions. 
For example, the graph of the function f(x) = 2? is ‘flat’ at the origin and 
becomes steeper as you move away from the origin to the left or right, as 
shown in Figure 1.2. 


In general, we need to talk about the gradient or slope of a graph at a 
point on the graph. This is the gradient of the unique line that ‘touches’ 
the graph at the point, as shown in Figure 1.3(a) overleaf. This line, the 
one which most closely approximates the graph of y = f(a) near the point, 
is called the tangent to the graph at the given point. 


Linear functions were 
introduced in Chapter A3, 
Subsection 1.2. The equations 
of lines and the definition of 
the slope or gradient of a line 
were given in Chapter A2, 
Subsection 1.1. 


If y decreases as x increases, 
then m is negative. If y does 
not change as x increases, 
then m = 0. 


Figure 1.2 Graph of y = x? 


The word ‘tangent’ comes 
from the Latin verb tangere, 
meaning to touch. 


CHAPTER C1 DIFFERENTIATION AND MODELLING — 


One way to visualise the 
tangent at a point on the 
graph in Figure 1.3(a) is as 
follows. ‘Think of 
progressively ‘zooming in’ on 
the portion of the graph 
which contains the particular 
point. This causes the 
portion of graph focused on 
to look straighter and 
straighter. Eventually it will 
look indistinguishable from a 
line segment, which is part of 
the tangent to the graph 
through the point. 


tangent 


(a) 


(b) No tangent to the graph 


Figure 1.3 
at (0, 0) 


(a) Tangent to a graph at a point 


You will see in this chapter that, for many functions f, the graph of f has 
a tangent at every point. Informally, a function is said to be smooth if it 
has this property. Any linear function is certainly smooth, since the 
tangent at each point coincides with the original straight-line graph. On 
the other hand, the function f(x) = |x| is not smooth. The graph of this 
function is shown in Figure 1.3(b). Zooming in on the point (0,0) will 
always give a V-shape for the graph rather than a line segment. Hence 
there is no tangent to the graph at this point. 


The gradients of tangents can often be expressed by a formula related to 
the rule of the function f. Differentiation is the process of finding this 
formula for all the tangent gradients, starting from the rule of the original 
function. 


For the line of Figure 1.1, you can think of the gradient as a measure of 
how fast variable y is changing as variable x changes: a large gradient 
corresponds to a rapid change of y with respect to x7, and a small gradient 
corresponds to a slow change of y with respect to x. The gradient of the 
line represents the rate of change of y with respect to zx. 


Similar remarks apply for graphs which are curved, such as the one in 
Figure 1.3(a), though here the rate of change of y with respect to x, being 
equal to the gradient at each point, varies as the point moves along the 
curve. The ‘rate of change’ relationship between variables is important in 
applications, and it is differentiation which enables us to find this rate of 
change at any point. You will see shortly how this process is carried out. 


Differentiation: informal description 


Differentiation is a process which enables you to find 


© the gradient of a graph; 


© the rate at which one variable changes with respect to another. 


1.2 Gradients of quadratic graphs 


In the audio tape that follows, you will see that all quadratic functions are 
smooth and how to calculate the gradient at any point on any quadratic 
graph. 


Now listen to Audio Tape 3, Band 1, ‘Gradients of quadratic graphs’. 


Frame 1 


The graph of y = x? 


——---— >= . 
| | (2, 4) Chord line segment 
| | joining two points 
| | on a curve: 
L-—-\-— —- — — +—----—--— : 
| | gradient = —. 
. run 
I l 
l r | 
; na 7 | For BD, 
\ 
| | a 
| | run ie 
I I ; 
Sa {ae tangent at B 
B’\ 
I | 
I | 
| : 
ar What is the gradient 

of the tangent 
=. = O 1 2 x ee 


Gradient 


Frame 2 


Gradient of tangent at (1, 1) 


“< 


Gradient of chord with endpoints 


(1,1) and (1+ h,(1+ h)*) is 


rise _ (1+h) —1 


(14h, (1 +h)*) 
run h 


142h4+H-1 
h 


_ oe Z 


EEA 


\ 


2+h. 


When his small: 


) 


) gradient of chord is close to gradient 
of tangent at (1,1); 


© 2+ his close to 2. 


5o gradient of tangent at (1,1) is 2. 


Frame 3° 


Gradient formula for f(x) = x 


Gradient of y = f(x) at (x, f(x)): 
P(x pe 26 


gradient of tangent 
at (x, f(x)) 


Frame 4 


Gradient formula for f (x) = ax’ + bK +¢ 


gradient: 2ax gradient: b gradient: O 


Gradient of y = f(x) at (x, f(x)): General quadratic 


gradient formula 


f(x) = 2ax+ Bb. 


Example 
fer fit os es 
Find the gradient of y= f (x) at F Qand RK. 


fie = a= 9 
Gradient at P: OxO-5 
Gradient at Q: (6 i) -~5 
Gradient at R: (6 x2)-5 = ] 


n Il 
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In Frame 4 of the audio tape it was stated that the gradient formula for a 
general quadratic function is given as: 


if f(z) = ax’ + bz +c, then f'(z) = 2axr +4 b, ee5 


where a, 6 and c are constants. This gives the value of the gradient of the 
graph of y = f(x) at the point (x, f(x)), which is also the rate of change of 
y with respect to x at this point. 


The following activities ask you to apply the quadratic gradient formula 
from result (1.1). 


Use result (1.1) to write down an expression for f’(x) in each of the 
following cases. 


(a) f(x) =2°+2+7 
(b) f(z) =5a* —x 
(ce) F(z) =5 + 32 — <2? 


Solutions are given on page 54. 


Activity 1.2. Finding gr 


Use result (1.1) to find the gradient at the point given on the graph of 
y = f(x) in each of the following cases. 


(a) f(z) =<, (5,3) 
(b) f(z) =—a*+1, (3,-8) 
(c) f(x) = 32? + 32x — 242, (1, -207) 


Solutions are given on page 54. 


As mentioned on the audio 
tape, f’(x) is pronounced as 
‘f dash of x’ or ‘f prime of x’. 


In Frame 2 of the audio tape you saw why the gradient of the function 
f(x) = x? at the point B(1,1) is 2, and it was asserted in Frame 3 that the 
gradient at a general point (x, f(a)) is given by 


fle) = 22. (1.2) 
We now justify this assertion, using reasoning similar to that in Frame 2. 


Figure 1.4 shows the graph of y = x”, with a particular point (z, 27) 
marked and a chord with one endpoint at (x, x”). 


7] 
(x +h, (a + h)*) 
l 
l 
| 
l 
chord : 
l 
| 
l 
l 
l 
l 
l 
l 
l 
Cg) es 
2 
yr 
O x a+h . 


Figure 1.4 Finding the gradient for y = r” 
at a general point on its graph 


In this argument, regard z as The other endpoint of the chord is at (x +h,(a+h)*), a nearby point on 
fixed while h varies. ‘The the graph. For small values of h, the gradient of the chord is close to the 


quantity h can be positive or gradient of the tangent at (2,27). The gradient of the chord is 


negative, but not zero. 
rise (235 =e 


run (x#+h)—-2 

g? + Qh + h? — x? 
h 

2rh + h? 


h 
= 27+ h. 


Now the smaller h becomes, the closer the chord gradient is to the tangent 
gradient, and the closer 27 + h is to 2%, so we conclude that the gradient 
of the tangent at (x, x”) is 2x, which confirms equation (1.2). 


The argument to establish the gradient formula for the general quadratic 
function f(x) = ax? + bx +c, as given by result (1.1), is very similar to 
that just employed to verify formula (1.2) for the special case where a = 1, 
b = 0 and c= 0. Moreover, the approach adopted here, of first evaluating 
the gradient of a chord from (z, f(a)) to (a +h, f(a +h)) and then ‘taking 
the limit’ as h tends to zero, can be applied much more generally. This is 
taken up in the next subsection. 


1.3 Differentiation of a function 


A formula for the gradient of the graph of a general function can be found 
by applying the same approach as was used at the end of Subsection 1.2 
for the particular case f(x) = x”. 


Suppose that f is any smooth function, so the tangent to its graph exists 
at all points. Figure 1.5 shows part of this graph, with a chord marked 
between the two points (x, f(a)) and (x +h, f(a +h)). 


(x +h, f(a + h)) 


tangent 


at (x, f(x)) 


Figure 1.5 Finding the gradient for y = f(«) at a general 
point on its graph 


This chord has gradient 
rise _ f(w@+h) = fle) 


run h 
Since f is a smooth function, its graph has a tangent at (x, f(x)), and the 
gradient of this tangent, denoted by f’(z), must be the limit of the chord 
gradient values as h tends to zero. In other words, we have 

/ —" fet h)= f(x) 

2) = jm (H+ =H) 49) 
We call f’(x), as defined by this formula, the derivative of f at the 
point z. For example, for the function f(2) = x7, where as you have seen 
f'(z) = 2a, we have: 
© the derivative of f at 1 is f’(1) = 2; 
© the derivative of f at 0 is f’(0) =0. 


In general, the derivative f’(2) depends both on the function f and on the 
point zx. Hence the derivative defines a function, called the derived 
function of f and denoted by f’. For example, for the function f(x) = 2”, 
the derived function f’ is defined by f’(a) = 2a. | 


The process of finding the derived function or derivative of a given 
function f is called differentiation. 


Once again, regard «x as fixed 
while A varies. The quantity 
h can be positive or negative, 


but not zero. 


The symbol fim is read as 


i 
‘the limit as A tends to 0’. 


Often the word ‘derivative’ is 


also used to refer to the 
derived function f’. 


Differentiation, derivative and derived function 
Let f be a function. 


© The derivative f'(x) at a point x in the domain of f is the 
gradient of the graph of f at (x, f(x)), given by 


/ cee * i@ er h) me yf ) 
f'(x) = lim ae) , (1.3) 
provided that this limit exists. 


If the derivative exists at each point of the domain of f, then we 
say that f is smooth. 


The process of finding the derivative f’(x) is called 


The verb associated with differentiation. 
se aimee _ The function f’ defined by the process of differentiation is called 
differentiate’. The meaning of : ‘ 

. the derived function of f/f. 
this verb in calculus is 
therefore not the same as its If y = f(x), then f’(x) is the rate of change of y with respect 
meaning in everyday English to. 


(where it means ‘to 
distinguish between’ or ‘to 


make different’). A word on domains is in order. The main emphasis in this course is on 


finding the rule for the derived function, f’, of f. However, f’ must also 
have a domain and this consists of those points in the domain of f where 
the derivative exists. In particular, if f is a smooth function, then the 
domain of f’ is the same as that of f. 


Depending on the particular purpose which you have in differentiating, you 
can regard the derivative as giving the gradient of the graph of the original 
function or the rate of change of the dependent variable with respect to the 
independent variable. Both interpretations are useful and widely applied. 


So far you have seen how to differentiate quadratic functions. ‘There are 
many types of smooth functions. In Sections 3 and 4 you will see the 
derived functions for a range of basic functions, together with some rules 
for finding derived functions for a wide class of functions. 


Other than in this section, we shall not ask you to spend time in justifying 
these various results formally, since this is of secondary importance 
compared to being able to apply them appropriately. Such justifications 
involve starting in each case from definition (1.3), and we now look briefly 
at what this involves in a few simple cases. 


nple 1.1 Differentiation from the definition 


Using this definition directly | Use equation (1.3) to differentiate the function f(x) = 2°. 
is also called ‘differentiating ; 
from first principles’. Solution 


By equation (1.3), we need to consider the quotient 


f(a+h)—f(z) (a+h)—-2° 


h h 


where hf is a non-zero number. 


Now we have 
(x +h)? =(x+h)(x+ h)? 
=(x+h)(x* + 22h+ h?) 
ao + 32°h oan ek. 


so the quotient becomes 


pa + hh) — f(x) = eae ee Sale +h” x" Since h # 0, the possibility of 
h h division by zero does not 
S 327*h + 3@h? + h? arise. 
ie ee 


= 377 + 32h +h’. 


For small values of h, the expression 3x? + 3xh + h? is close to 3x7 because 
both 3xh and h? are small. Thus in the limit as h — 0, we obtain 


fia) = 3x’. 


The result of Example 1.1 can be applied to give the gradient at any point 
on the graph of y = x°. For example, the points (1,1) and (—1, —1) both 
lie on y = x°, and the gradients at these points are respectively 


oto and jf (—1) = 3. 


The tangents at these points are shown in Figure 1.6, together with the 
tangent to y = x? at (1,1), which has gradient 2, for comparison. 


\ y [= ; 
; , (gradient 2 


oo 
\ 
\ 
\ l ay) 
\ 
\ 
: y 
— : gradient 3 
sr oe 
N Pas 
—1 0 1 . 
ai, = —] 


gradient 3 


Figure 1.6 Gradients of tangents to the graphs of y = 2° 
and y = x7 


Note that if x = 5, then 
y = Gy 2 6: Hence the 
point (5, =z) lies on the 


Recall that x° = 1 
and x! = a. 


The domain of a power 
function depends on the value 
of n. If n is a non-negative 
integer, then the domain is R. 


Recall that 1/2° = z~° and 


Je = xil?, 


This derived function gives 
the instantaneous velocity of 
the car, which is what would 
be recorded at each time ¢ on 
its speedometer. 
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Activity 1.3 Differentiation from the definition 


(a) Use the expansion for (2 + h)* from Example 1.1 to find the expansion 
for (29 +h)* = (a+h)(a +h)’. 

(b) Use equation (1.3) and the result of part (a) to differentiate the 
immction f( 7) =2" 

(c) What is the gradient of the graph of y = x* at the point (5,4)? 

Solutions are given on page 54. 


You have now seen it established that: 

© if f(x) = 2’, then f’(x) = 2x (Subsection 1.2); 

© if f(z) = 2°, then f’(x) = 32° (Example 1.1); 

© _if f(a) = 2*, then f'(e)= ae" (Acti 2 

In addition, you know that if f(a) = x then f’(x) = 1, since the line y = x 
has gradient 1, and that if f(~) = 1 then f’(x) = 0 because the line y = 1 


is horizontal and hence has gradient zero. These results can be 


summarised in the single statement 
if f(z) = 2", then f'(a) =a" > tae 4. 


In fact, this statement holds not just for these particular values but for any 
real value of n, so we can write 


if f(c) = 2", tha FG) ane, 


where n is any real number. Functions of the form f(z) = x” are called 
power functions, so result (1.4) gives the rule for differentiating any 
power function. This result is often expressed as 


(1.4) 


the derivative of x” is nx”. 


Activity 1.4 Differentiation from a formula 


Use result (1.4) to differentiate each of the following functions. 


(a) f(z)=2° = (b) f(z)=1/ae* (ce) fF(a)= ve (e>0) 


Solutions are given on page 54. 


The concept of differentiation applies equally to situations where variables 
other than x and y are involved. In the course of mathematical modelling 

it will often be the case that other variable labels occur. For example, if s 
(in metres) is the position of a car moving along a straight road, measured 
from a fixed starting point, and t (in seconds) is the time since leaving the 
starting point, then the particular motion of the car may be described by 

s = f(t) for some smooth function f. The derivative of this function gives 
the rate of change of the car’s position with time, which is its velocity 

v = f'(t), measured in metres per second (ms~‘). 


In fact, this example permits a physical interpretation of the 
definition (1.3) for the derivative. If s = f(t), then 


f(t +h) —f®) 
h 


is the average velocity (distance travelled divided by time passed) over the 
interval between time ¢t and time t+ h (assuming h > 0). The 
instantaneous velocity v = f’(t) at time t¢ is the limit of such average 
velocities as the duration h of the time interval tends to zero. 


Example 12 | 


A car moving along a straight road has position s = 30t — t? metres, as 
measured in a given direction from a fixed point O, at time t seconds after 
it passes O. 


(a) Find the corresponding velocity v (in ms~') of the car at time t. 
(b) What is the velocity of the car as it passes O? 


(c) At what time does the car have velocity zero? 


Solution 


(a) In this case the position function is s = f(t) = 30t —t?. This isa 
quadratic function (of t), so we may apply result (1.1) to obtain the 
velocity function 


y= f(t) = 30 — 2t. 


(b) The car passes O when t = 0, at which time we have v = f’(0) = 30, so 


the velocity is then 30 ms“. 


(c) When v = 0, we have 30 — 2¢ = 0, which has solution ¢ = 15. Hence 
the car has velocity zero (is at rest, momentarily) after 15 seconds. 


A stone dropped from rest has position s = 5t? metres below its starting 
point after t seconds. 


(a) Find the velocity v (in ms~*) of the stone at time t. 
(b) What is the velocity of the stone after 3 seconds? 
(c) At what time does the stone have velocity 15 ms~'? 


Solutions are given on page 54. 


For t > 15 the values of v are 
negative, so the car then 
moves back towards O. 
However, the model may be 
intended to apply only 


noe <at < 15. 


Summary of Section 1 


This section has introduced: 


© differentiation (informally) as a process which enables you to find 
either the gradient of a graph, or the rate at which one variable 
changes with respect to another; 


© differentiation of a function f (formally) as the process of finding the 
derived function f’ and derivative f’(x), as defined by 


f'(0) = jim (A241), (1.3) 


© the application of differentiation to quadratic functions and to power 
functions f(x) = x”, where n is a real number. 


Exercises for Section 1 


Exercise 1.1 
Use result (1.1) to find the gradient of the graph of y = 6 + 5x — Ax* at the 
point (2,0). 
Exercise 1.2 
(a) Show that if #0 andx+h #0, then 
1 1 h 


ct+h «x (x+h)a 


(b) Use equation (1.3) and the result of part (a) to differentiate the 
function 


(c) Verify that the result of part (b) can also be obtained by applying 
result (1.4). 


Exercise 1.3 


An aeroplane accelerating along a runway before take-off has travelled a 
distance s = 3t? +t metres from the start of the runway after t seconds. 


(a) Find the corresponding velocity v (in ms~*) of the aeroplane at time t. 
(b) What is the velocity of the aeroplane after 10 seconds? 


(c) At what time does the aeroplane have velocity 25 ms~*? 


In Section 1 the process of differentiation was introduced. You saw how 
this process could be applied in certain cases to find formulas for the 
derivatives of functions, and so obtain values of the gradients at points on 
the corresponding graphs. 


A major application of differentiation is in finding an optimum value of a 
function, that is, a highest or lowest value. In a modelling context this 
may represent the ‘best’ way of achieving some practical outcome, subject 
to certain constraints. For example, it might correspond to a maximisation 
of profit or a minimisation of resource expenditure. 


Optimisation is discussed in Subsection 2.3. In preparation for this topic, 
Subsection 2.2 shows how differentiation can assist in understanding the 
structure of the graph of a given function, which is also useful in its own 
right. The examples here are based on simple polynomial functions. 
Subsection 2.1 shows how to find the derivatives of such functions. 


2.1 Differentiating polynomial functions 
A polynomial function of degree n is a function whose rule has the form 

= @.2 14,12" "+--+ aor" + ae + G, (2.1) 
where a,, #0. This includes the cases of linear functions (n = 1), quadratic 
functions (n = 2) and cubic functions (n = 3). For each of the first two of 
these cases, you saw in Section 1 how the corresponding derivative is 
obtained. Thus 

f(x) = a,x + ap has derivative f’(x) = a, 
and 

f(x) = agx? + a,x + ao has derivative f’(x) = 2agx + ay. 
For the general polynomial function given by equation (2.1), the derivative 
is 

f'(x) = na,z”* + (n — 1)ap_1e™ 7 +--+ + Qagx t+ ay. 


This result can be written concisely, using sigma notation, as 


if f(x = Dae" then f(x = Dia" = (2:2) 


Note, from result (1.4) for the derivative of x”, that result (2.2) is 
equivalent to the statement 


if f(x => a;x’, then f'(x 9 a;(derivative of x’), (28) 


jax) 


which is an alternative way to remember the result. 


Polynomial functions were 
introduced in Chapter A3, 
Section 5. 


The line y = a,x + ao has 
gradient a,. The derivative of 
a quadratic function was 
given, with different labels for 
the coefficients, in result (1.1). 


Sigma notation was 
introduced in Chapter B1, 
Subsection 1.2. 


The lower limit in the 
summation for f’(a) in 
result (2.2) can be written as 
4 = 1 rather than 7 = 0, since 
the value of the term ia;x*~! 
for 1 = 0 is zero. 


Comparing 

f(x) = 22° — 9x* + 3x with 
equation (2.1), we have n = 6, 
a, = 3, a4 = —9, ag = 2 and 
= Go == ds = a, = 0. 


A polynomial function of 
degree 4 is called a quartic 
function. 


Differentiate the polynomial function f(x) = 2x° — 9x* + 3x. 


Solution 


From result (1.4), the derivative of x° is 62°, the derivative of x* is 42° and 
the derivative of x (= x') is 1 (= x°). From result (2.3), the derivative of 
the given polynomial function is 


f'(x) = 2(derivative of x°) — 9(derivative of x*) + 3(derivative of x) 
= 2(6x°) — 9(4x°) + 3(1) 
12s? = Bie eds 


(a) Differentiate the polynomial function f(x) = 6a* — 32°. 


(b) What is the gradient of the graph of y = —g? +52? —7x+15 at the 
point (1,12)? 3 


Solutions are given on page 94. 


2.2 Features of graphs 


In this subsection you will meet methods for identifying various key 
features of the graphs of functions. The examples involve polynomial 
functions, for which the derivative formula was given in the previous 
subsection. However, the degree of the polynomial function in each case 
will be no higher than 4. 


Several useful results will be stated. Note that these results apply to any 
smooth function, and not just to the types of polynomial function 
considered in the examples. 


We look first at the graph of the cubic function 
f(x) = 22° + 3x* — 122 — 4, 


which is shown in Figure 2.1. 


Figure 2.1 Graph of y = 2x? + 3x2 — 12x” —4 


The labels in Figure 2.1 refer to the following important features of this 
graph. 
(a) the graph crosses the x-axis at three points, approximately at 

Pee 3.2, 2 => —0.3 and x = 2.0; 
b) the graph crosses the y-axis at one point, where y = —4; 
c) the graph has a ‘peak’ at approximately (—2, 16); 
) 


( 
( 
(d) the graph has a ‘trough’ at approximately (1, —11); 
(e) to the left of the peak, the graph rises; 

(f) between the peak and the trough, the graph falls; 
(g) to the right of the trough, the graph rises. 


We now consider each of features (a)—(d), in turn. 


The z-values where the graph crosses the z-axis are the solutions of the 
equation 

f(x) = 0; 
This is a cubic equation, and in general such equations cannot be solved 
straightforwardly by hand. 


Mage 26° 4 32° — 127 —4=0. 


The y-value where the graph crosses the y-axis is f(0) = —4, which is the 
constant term in the polynomial. 


A peak such as the one near (—2,16) in Figure 2.1 is called a local 
maximum of the function because it gives the greatest function value in 
the immediate vicinity of the point (the word ‘local’ is needed because the 
function may take greater values elsewhere). Similarly, the trough on the 
graph near (1,—11) is called a local minimum. 


Note that this and some other 
graphs in this chapter are 
drawn with unequal scales on 
the two axes. 


In (e)-(g), and in similar 
statements throughout this 
subsection, it is assumed that 
the graph is traversed from 
left to right. 


However, in this particular 
case, it turns out that x = 2 
is a solution, from which the 
other two solutions can also 
be found if needed. 


The plural of ‘maximum’ is 
‘maxima’, and the plural of 
‘minimum’ is ‘minima’. 


As you will see, not all 
stationary points are local 
maxima or minima. 


More precisely, there is a 
local maximum at x2 = %o if 
there is some open interval J, 
containing x9, such that 

f(x) < f(%o) for any x F xo 
in J. A similar statement can 
be made for a local minimum, 
with the inequality replaced 


by f(x) > f(Zo). 


In general, at local maxima and minima, the gradient is zero. In other 
words, the tangent at a local maximum or minimum is horizontal, as 


shown in Figure 2.2. 


local 
maximum 


local 
minimum 


Figure 2.2 Horizontal tangents at local maximum 
and minimum 


Thus the local maxima and minima can be found by identifying those 
values of x for which f’(2) = 0, which are known as the stationary points 
of the function f. 


Stationary point, local maximum and local minimum 
Let f be a smooth function. 


© The function f has a stationary point at © = Zo if f’(x) = 0. 
The corresponding point (x9, f(xo9)) on the graph of f is also 
called a stationary point. 


The function f has a local maximum at a stationary point 


x = Xo if f(x) is greater than any other function value in the 
immediate vicinity of x9. The corresponding point (Xo, f(xo)) on 
the graph of f is also called a local maximum. 


The function f has a local minimum at a stationary point 

x = Lo if f(x) is less than any other function value in the 
immediate vicinity of x9. The corresponding point (2, f(xo)) on 
the graph of f is also called a local minimum. 


(a) Differentiate the function 


f(x) = 22° + 32? — 122 — 4, 


whose graph was given in Figure 2.1. 


(b) By solving the equation f’(x) = 0, find the stationary points of f. 


Solution 


(a) From results (1.4) and (2.3), the derivative of the given polynomial 
function is 
eee) 3 (22) — 12(1) 
=e oe 72 
= 6(47 + x — 2). 
(b) The equation f’(x) = 0 is equivalent to 


fee thai is, (7+ 2)\(o— 1) =0. 
This equation has solutions x = —2 and x = 1. Thus the stationary 
points of f are at x = —2 and x = 1. 
Comment 
By substituting the values x = —2 and x = 1 in turn into the rule for f, we 


find that f(—2) = 16 and f(1) = —11. In fact, the two stationary points 
correspond to the local maximum (—2,16) and the local minimum (1, —11). 
This confirms the values given for features (c) and (d) following Figure 2.1. 


Activity 2.2 Finding stationary points ee =e 


(a) Differentiate the function 
f(x) = x° — 62" — 152 + 54. 
(b) By solving the equation f’(x) = 0, find the stationary points of f. 


Solutions are given on page 55. 


Next we discuss the three features (e), (f) and (g) of the graph in 
Figure 2.1. These relate to the parts of the graph which rise or fall. To 
describe these features more formally, it is convenient to extend the 
concept of an increasing (or decreasing) function, as follows. 


Suppose that f is a real function and that the interval J is in the domain 
of f. Then f is said to be increasing on / if 


for all @y, 7. ind, 194 @,, then f(2,) < f (x9), 
and f is said to be decreasing on / if 
for all 71,0 indy wi < Bo, then f(e,) > Fes 


For example, the function f(x) = x? is increasing on the interval (0, oo) 
and decreasing on the interval (—oo,0); see Figure 2.3 overleaf. 


Note that, in order to answer 
this type of question, it 
suffices to find the 
x-coordinate of each 
stationary point, unless you 
are explicitly asked to 
calculate also the 
corresponding function value. 


The concepts of ‘increasing’ 
and ‘decreasing’ were 
introduced in Chapter A3, 
Subsection 4.1, where they 
applied to the whole domain 
of a function. 


Note that in this criterion J is 
an open interval. 


In this display the common 
practice of writing ‘for x in 
.«. +0 taean “for all ¢ in-... 
has been used. This practice 
will be followed except in 
statements of definitions and 
results. 


) 


increasing 
on (0, co) 


decreasing 
on (—oo, 0) 


Figure 2.3 Intervals on which f(x) = 2? is 
increasing /decreasing 


We can often determine intervals on which a function f is increasing or 
decreasing by using differentiation. This is so because the derivative f’(x) 
gives the gradient of the graph at the point (x, f(x)), and the sign of the 
gradient is related to whether the function is increasing or decreasing. 


Increasing /Decreasing Criterion 
Let J be an open interval in the domain of a smooth function f. 


© If f’(x) > 0 for all x in J, then f is increasing on I. 
© If f(x) < 0 for all x in J, then f is decreasing on I. 


For example, consider the function f(x) = «* whose graph is shown in 
Figure 2.3. In this case 
; 
ames Fie > 0 fore in (0, o0], 
il a <0 for x in (—oo,0). 
This corresponds to the increasing and decreasing properties of f shown in 
Figure 2.3, in accordance with the Increasing/Decreasing Criterion. 


Example 2.3 Finding where a function is increasing/decreasing — 
In Example 2.2 you saw that the function f(x) = 27° + 3x27 — 12x — 4 has 
derivative 

f'(x) = 6(2* + x — 2) = 6(x + 2)(z — 1). 
(a) Show that 


f'(z) >0 for x in (—oo, —2), 
re <0 for x in (—2,1), 
f(z) >0 fore im(sse). 


(b) Use the Increasing/Decreasing Criterion to deduce from part (a) 
intervals on which f is increasing or decreasing. 


Solution 
(a) For x < —2, we have 
£+2<0 and «r-1<-3< 40, 


so f'(x) = 6(@ + 2)(a — 1) > 0 (since f(z) is the product of 6 and of 
two negative numbers). 


For —2 < x < 1, we have 
£+2>0 and «-1<4J, 


so f'(x) < 0 (since f’(x) is the product of 6, one other positive number 
and one negative number). 


For z > 1, we have 
t+2>37->0. and 2+=— 1 > 0, 


so f'(x) = 6(a + 2)(a — 1) > 0 (since f’(x) is the product of 6 and of 
two other positive numbers). 

(b) By the Increasing/Decreasing Criterion, f is increasing on (—oo, —2) 
and on (1,00), but decreasing on (—2,1). 


Activity 2.3 Finding where a function is increasing/ decreasin, 7 ] 
In Activity 2.2 you showed that the function f(x) = x? — 6x? — 15a + 54 
has derivative 

f (x) = 327 — 122 — 15 = 3(z + 1)(a — 5). 
(a) Show that 


f'(x) >0 for x in (—oo, -1), 
f'(2) <0. fore i{—1, 5), 
}' (a). > 0. forme dmyGjook 
(b) Use the Increasing/Decreasing Criterion to deduce from part (a) 
intervals on which f is increasing or decreasing. 


Solutions are given on page 55. 


Figure 2.5 Graph of y = x° 


The connection between derivatives and intervals on which a function is 
increasing or decreasing can be used to provide a useful test for classifying 
a stationary point as a local maximum or a local minimum. This is stated 
in the box below and illustrated in Figures 2.4 and 2.5. 


First Derivative Test 

Suppose that xp is a stationary point of a smooth function f; that is, 

f'tor=3 

© If f’(x) changes sign from positive to negative as x increases 
through xo, then f has a local maximum at Zo. 


© If f’(x) changes sign from negative to positive as x increases 
through xo, then f has a local minimum at Zo. 


If f’(a) does not change sign as x increases through Zo, then f 
has neither a local maximum nor a local minimum at Zo. 


Figure 2.4 (a) Local maximum _(b) Local minimum 


In the case of a local maximum, the function is increasing on an interval to 
the left of z) and decreasing on an interval to the right of 7. The reverse 
is true for a local minimum. 


Note, however, that a stationary point (at which f’(#) = 0) need not give 
rise to either a local maximum or local minimum, as demonstrated by the 
example of f(x) = x°, whose graph is shown in Figure 2.5. This function 
has a stationary point at 2 = 0, which is neither a local maximum nor a 
local minimum. In this case the function is increasing on all intervals to 
the left or right of = 0. This is confirmed by the sign of the derivative 
f'(x) = 3x", which is positive wherever x # 0. 


To determine whether a general function f is increasing or decreasing to 
the left or right of a stationary point %o, it is usually sufficient to check the 
sign of the derivative f’(x) at two points, x = x, to the left of rp and 

L = Lp to the right of xo, as specified in the following strategy. ‘The First 
Derivative Test can then be applied in the form given in the strategy. 


Strategy 
To identify the local maxima and local minima of a smooth 
function f. 
1. Find the stationary points of f. 
2. For each stationary point %o, 
(a) choose points xy, to the left of x) and xp to the right of zo, 
such that the interval [x,, 7p] lies in the domain of f and In this chapter we apply this 


there are no stationary points between z;, and x, nor between strategy only to polynomial 


qo and xp, and then calculate f’(x,) and f’(xR); functions. Hence the interval 
[xL, Rp] always lies in the 


(b) classify x9 as follows: ek off. 


© if f'(z_) >0 and f’(xR) < 0, then f has a local maximum 
at Lo; 

© if f’(a_) <0 and f’(xg) > 0, then f has a local minimum 
at Lo; 


if f’(x,) and f’(xp) have the same sign, then f has 
neither a local maximum nor a local minimum at Zo. 


Remarks 

1. For each stationary point x9, the points x; and xp described in 
Step 2(a) are referred to as ‘test points’. 

2. The choice for a pair of test points is not unique. We usually choose 
test points near x, that facilitate the calculation of the derivatives in 
Step 2(a). 

Before applying the complete strategy, here is some practice in choosing 

test points. 


In Example 2.2 it was found that the function 
f(x) = 22° + 327 — 122 —4 


has stationary points at x = —2 and x = 1. Determine a pair of test points 
for each of the stationary points. 


A solution is given on page 55. 


The complete strategy is illustrated in the following example. 


Use the strategy to apply the First Derivative Test to classify each of the Here ‘classify’ means ‘identify 
stationary points of the function as a local maximum, a local 
minimum, or neither’. 
fizajax=29 475 


There are no stationary 
points between —1 and 0, nor 
between 0 and 1. 


The notation f” is 
pronounced as ‘f double dash’ 
or ‘f double prime’. 


Third, fourth and 
higher-order derived functions 
and derivatives may be 
defined in a similar way. 
When more than one order of 
a derivative is involved, f’(x) 
may be called the first 
derivative (as in First 
Derivative Test). 


28 


CHAPTER CI DIFFERENTIATION AND MODELLING — 


Solution 
Step 1: The derivative of f(x) = 2° — 32° + 1 is 
f'(x) = 3x? — 6x = 32(x — 2). 
The equation 3x(a — 2) = 0 has solutions x = 0 and x = 2, so the 


stationary points of f are at x = 0 and x = 2. (These are the only 
stationary points of f.) 


Step 2: To classify the stationary point at x = 0, we choose test points 
tr, = —l and zp = 1, say. Then we have 


f'(ay) = f'(-1) = 3(-1)(-3) = 9 > 0, 
flea) - FOS) = 3 < 0, 
so f has a local maximum at x = 0, by the First Derivative ‘Test. 


To classify the stationary point at x = 2, we choose test points zy, = 1 and 
LR = 3, say. ‘Then we have 


f(a.) =f’) =-3 <0, 
f'(en) = f'(3) = 3(3)(1) =9 > 0. 


so f has a local minimum at x = 2, by the First Derivative Test. 


Activity 2.5 Applying the First Derivative Test 


Use the strategy to apply the First Derivative Test to classify each of the 
stationary points of the function 


f(x) = 3x* — 2a? — 9x? +7, 
noting that the derivative has a factor 2. 


A solution is given on page 55. 


There is another test that is sometimes useful for the purpose of classifying 
the stationary points of a function. This involves the second derivative of 
the function. 


Second derivatives 


When we differentiate a smooth function f, we obtain the derived 
function f’. Often this also is a smooth function, which can itself be 
differentiated. The result of doing so is the derived function of f’, which is 
denoted by f” and called the second derived function of the original 
function f. The value of f” at a given point is called the second 
derivative of f at that point. (Often the term ‘second derivative’ is also 
used to refer to the second derived function f”.) For example, if f(x) = 2’, 
then 


figj=2 wo Pies 


Since the derived function of any non-constant polynomial function is 
another polynomial function, results (1.4) and (2.3) can be used as before 
to find the second derivative of a polynomial function. 


You showed in Activity 2.1(b) that the function 
f(x) = -2* +52? — 7x +15 

has derivative 
f'(x) = —3x? + 102 — 7. 

Find the second derivative of the function f. 


A solution is given on page 55. 


Second derivatives, like first derivatives, frequently crop up in 

mathematical models. For example, you saw in Subsection 1.3 that if 

s = f(t) (in metres) is the position of a car moving along a straight road, 

measured from a fixed starting point, where t (in seconds) is the time since 

leaving the starting point, then the velocity of the car is represented by 

v = f’(t),in ms~’*. A further differentiation gives the acceleration of the The acceleration is the rate of 


car, a = f(t), measured in metres per second per second (ms~’). change of velocity v with 


CL ; aoe 4 ‘ respect to time f. 
The second derivative is used in a further test for classifying the stationary 


points of a function. To show briefly why this works, suppose that % isa __sIt is not essential to follow 
stationary point of a function f, so that f’(a)) = 0, and suppose also that the details in this paragraph. 
f’(xo) > 0. Then there will be some open interval J, with xp in I, such If you are short of time, then 
that f’(x) > 0 for x in I. It follows from the Increasing/Decreasing move straight to the boxed 
Criterion, applied to the function f’ rather than to f as originally stated, Statement below. 

that f’ is increasing on J. This means, since f’(2%)) = 0, that f’ changes 

sign from negative to positive as x increases through x9. Thus Zp is a local 

minimum, by the First Derivative Test. A similar argument shows that if 

f" (zo) < 0 at a stationary point 2, then 2p is a local maximum. 


Second Derivative Test 
Suppose that x9 is a stationary point of a smooth function /; that is, 
f' (xo) ce | 


© If f”(a) <0, then f has a local maximum at Zo. Note that this test gives no 
information if f(a) = 0. 


© If f’(ro) > 0, then f has a local minimum at Zp. 


One way of remembering the details of this test is indicated by the two 
‘faces’ shown in Figure 2.6. The depiction of the ‘nose’ on each face, by 
resembling the double prime notation in f”, is a reminder that the second 
derivative is involved. The ‘mouth’ indicates the shape of the graph close 


to the stationary point x , while the ‘eyes’ give the corresponding sign 
of f(x). Figure 2.6 ‘Faces’ to 


recall the Second 
Derivative ‘Test 
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at 0 


Figure 2.7 Graph of 
f(a) = x" 


1 z 


In Example 2.4 it was found that the function 


f(z) =a° —327+1 


has derivative 
f'(x) = 3a? — 6x = 3a(x — 2) 


and hence has stationary points at x = 0 and x = 2. Use the Second 
Derivative Test to classify each of these stationary points. 


Solution 
The second derivative of f is obtained by differentiating f’(x), to obtain 
f' (ae) = Ge — 6 = Ge 1). 


For the stationary point at x = 0, we have f”(0) = —6 < 0, so f has a local 
maximum at x = 0, by the Second Derivative Test. 


For the stationary point at x = 2, we have f”(2) =6 > 0, so f has a local 
minimum at x = 2, by the Second Derivative ‘Test. 
Comment 


As expected, these classification results agree with those found by applying 
the First Derivative ‘Test in Example 2.4. 


In Activity 2.5 you found that the function 
f(x) = 3a* — 22° — 92° +7 
has derivative 
f'(x) = 122° — 62° — 182 = 6x(22 — 3)(x4 + 1) 


and hence has stationary points at x = —1,x=QOandzr= 7 Use the 
Second Derivative Test to classify each of these stationary points. 


A solution is given on page 55. 


The solutions to Example 2.5 and Activity 2.7 suggest that it is easier to 
use the Second Derivative Test than the First Derivative Test when 
classifying stationary points, despite the additional differentiation required. 
Indeed, it is frequently easier to use the Second Derivative Test for 
polynomial functions, but you should be aware that for certain functions 
this test cannot be applied. For example, the function f(x) = x*, whose 
graph is shown in Figure 2.7, has a stationary point at x = 0 which is a 
local minimum. However, in this case f’(r) = 42°, so f” (a) = 12x? and 
hence f”(0) = 0. This means that the Second Derivative Test cannot be 
applied. On the other hand, the First Derivative ‘Test can be applied. In 
the notation of the strategy (page 27), if x, = —1 and rp = 1, say, we have 


f'(a,) = f'(-1)=-4<0 and f'(ep) =f’) =4>0, 


confirming the presence of a local minimum. For this reason, it is 
important to be able to use both tests. 


Sketching the graph 


Once the stationary points of a function have been classified, it is often 
possible to produce a reasonable sketch of the graph of the function. For 
example, the function 


f(x) = 2° — 327 +1 


has a local maximum at x = 0 and a local minimum at x = 2. Since we 
have f(0) = 1 and f(2) = —3, the shape of the graph near the points (0, 1) 
and (2, —3) can be indicated as in Figure 2.8(a). Because there are no 
other stationary points, f must be increasing on (—oco,0), decreasing on 
(0,2) and increasing on (2,00), so we can complete a rough sketch of the 
graph as shown in Figure 2.8(b). Note that for such a rough sketch it is 
not necessary to calculate or estimate the values of the x-intercepts, but 
the sketch should show the y-intercept. | 


f= —oe 1 


Figure 2.8 Sketching the graph of y = 2° — 3x7 + 1 


Though rough, this sketch gives a good impression of the overall shape of 
the graph. In the next activity, you are invited to produce a similar rough 
sketch. 


_ Activity 2.8 Sketching a graph 


The stationary points of the function 
f(x) = 3z* — 22° — 9a? + 7, 


were found and classified in Activities 2.5 and 2.7. They are at x = —1 
(local minimum), x = 0 (local maximum) and x = $ (local minimum). 
Find the y-coordinates of each of these points on the graph, and hence 
draw a rough sketch of the graph of this function. 


A solution is given on page 56. 


The stationary points of this 
function were found and 
classified in Examples 2.4 
and 2.5. 


Here the y-intercept happens 
to correspond to a stationary 
point. 
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All the functions considered as examples in this section are polynomial 
functions. As was pointed out earlier, all the boxed results in this 
subsection apply more generally, to any smooth function f. The First and 
Second Derivative ‘Tests can be used to classify the stationary points of 
many non-polynomial functions, and the classification can be used to help 
sketch graphs of the functions. However, this only becomes a practical 
proposition once you know how to differentiate non-polynomial functions. 
Sections 3 and 4 will widen considerably the range of functions that you 
can differentiate. 


2.3 Optimisation 


In many modelling situations, it is required to find the greatest or least 
value attained by a function f as the independent variable ranges over a 
particular interval (arising from constraints within the model). Two such 
situations are illustrated in Figure 2.9, where the interval is J = [a,b]. 


Figure 2.9 Greatest and least values 


In Figure 2.9(a), the greatest value of f is attained at B and the least 
value is attained at A. Both A and B correspond to endpoints of the 
interval. In Figure 2.9(b), however, the greatest value of f is attained at an 
endpoint (B again) but the least value is attained at D, a local minimum. 


The greatest or least values attained by a function on an interval are called 
the optimum values on that interval, and the process of finding these 
values is called optimisation. Since optimum values can occur only at 
endpoints or at local maxima or minima, the following procedure can be 
relied on to find the optimum values and where they occur. 


SECTION 2. GRAPHS AND OPTIMISATION. 


Optimisation Procedure 

To find the greatest (or least) value of a smooth function f on a 

closed interval J within the domain of f, proceed as follows. 
Find the stationary points of f. 


Evaluate f at each of the endpoints of J and at each of the Note that it is not essential 


stationary points inside I. here to classify the stationary 
points. 


Choose the greatest (or least) of the function values found in 
Step 2. 


The following example illustrates the use of this procedure. 


Example 2.6 Finding optimum values” 
Find the greatest and least values of the function 
f(x) = —2x° + 3x? + 72x 
on the interval J = [—5, 5]. 


Solution 
We follow the steps of the Optimisation Procedure. 
Step 1: ‘The derivative of the given function is 
f'(z) = —62? + 62 + 72 = —6(2? — x — 12) = —6(x + 3)(x — 4), 
so the stationary points are at x = —3 and x = 4. 


Step 2: At the endpoints of the interval J = |—5, 5], the function values 
are 


f(—5) = 250 + 75 — 360 = —35 and f (5) = —290 + 75 + 360 = 185, 


while at the stationary points (which are both inside J) the function values 
are 


f(—3) = 54+ 27 —- 216 =—-1385 and f(4) = —128+4+ 48 + 288 = 208. 


Step 8: 'The greatest value of f(x) on [—5,5] is 208 (at 2 = 4), and the If J had been [—2, 3], then 

least value of f(x) on [—5, 5] is —135 (at « = —3). both stationary points found 

aa a ce re ee a ee ee ) wild be antsice J. 
In this case, the greatest and 
least values of f(x) on I are 

seine §8—f (3) — 189 anid 

Activity 2.9 Finding optimum values | | f(—2) = -116, respectively. 


In each of the following cases, find the greatest and least values of the 
function f on the given interval J. 


ay fiz) = gr) — e—8e+1 I={-3,5] 
b) Pe te? eg ret 5 


Solutions are given on page 56. 


28 


Summary of Section 2 


This section has introduced: 
© the formula for obtaining the derivative of a polynomial function; 


© the stationary points of a function f, being the points where f’(x) = 0, 
including the cases of a local maximum and a local minimum; 


© the Increasing/Decreasing Criterion, which uses the sign of f’(x) to 
determine whether f is increasing or decreasing on an interval; 


© the First Derivative Test, and an associated strategy, for classifying 
each stationary point of a smooth function as a local maximum, a 
local minimum, or neither of these; 


© the Second Derivative Test for classifying each stationary point of a 
smooth function (where applicable) as a local maximum or a local 
minimum; 

© use of the location and classification of stationary points in sketching 
the graph of a smooth function; 


© the Optimisation Procedure for finding the optimum (greatest or least) 
value of a smooth function on a closed interval. 


Exercises for Section 2 


Exercise 2.1 
(a) Differentiate the polynomial function f(z) = 7x'°! — 132°° + a. 
(b) What is the gradient of the graph of y = 52° + 42° — 9x — 8 at the 
point (—1,0)? 
Exercise 2.2 
(a) Find the stationary points of the function 
f(x) = —a? — 3x7 + 9a + 5. 
(b) Classify each of the stationary points found in part (a), 
(i) using the strategy to apply the First Derivative Test; 
(ii) using the Second Derivative Test. 


(c) Find the y-coordinates of each of the stationary points on the graph, 
and evaluate f(0). Hence draw a rough sketch of the graph of this 
function. 


Exercise 2.3 
In Activity 2.2 you found that the function 
f(x)= 2° — 6x? — 152+ 54 
has derivative 
f'(x) = 3a? — 122 — 15 = 3(a + 1)(x — 5) 
and hence has stationary points at x = —1 and z = 5. 


(a) Use the Second Derivative Test to classify each of these stationary 
points. 

(b) Find the greatest and least values of the function f on each of the 
following intervals I. 


iy J =e (ii) I = |—4,7| 


To study the latter part of Subsection 3.3 you will need a video player and 
the Video ‘Tape. 


In this section, the main aim is to extend the range of functions which you 

can differentiate. In Subsection 3.1 the derivatives of sin xz, cosx, e* and These functions were all 
In x are introduced, with some informal justification. These results are discussed in Chapter A3. 
then tabulated, together with that for the derivative of x” which you saw 

in Subsection 1.3. 


Subsection 3.2 states two straightforward rules that make it possible to 
differentiate various combinations of these basic functions. Subsection 3.3 
introduces the alternative Lezbniz notation for derivatives, which features 
in the Video ‘Tape. 


3.1 Basic derivatives 


We look in turn at several basic functions, reviewing first a case that you 
have already seen. 


Power functions and constant functions 


It was stated in Subsection 1.3 that the derivative of any power function is 
given by the result 


if fir) =o" then. 7 a ae, (1.4) 


where n is any real number. You saw this formula for f’(x) justified in 
several individual cases (n = 0,1, 2,3, 4). 


In the case n = 0, the function is simply f(a) = 1, and the derivative 

f'(a) = 0 corresponds to the fact that the graph of y = 1 is a horizontal 
line, which has gradient 0 at all points. It is clear that the same applies for 
any other horizontal line, whose equation has the form y = c where c is a 
constant. Any function f(x) = c, where c is a constant, is called a 
constant function. Thus if f is any constant function, then its derivative 
is Zero: 


if f(z) = c (c COnstam ew Fe fet) 


Sine and cosine functions 


There now follows an attempt to make plausible the formulas for the While the following text 
derivatives of the trigonometric functions sin and cos, without resorting to should give you some idea of 
a formal justification. how the derivative formulas 
for sin and cos arise, it is not 
The approach will be the opposite of that employed in Subsection 2.2. onc dain i tatedix 


There information about the values or sign of f’(x), obtained from its rule, to apply these results. If you 
was used to deduce the behaviour of the graph of f(a). Here, the graph of are short of time, you may 
f(x) is used to sketch the graph of f’(x2), which suggests what the rule for prefer to note results (3.2) 
f'(x) might be. We start with the graph of f(x) = sinz, as shown in and (3.3) below, and then 
Figure 3.1(a), overleaf. read on from that point. 


Note that, in this graph and 
throughout the following 
discussion, x is in radians. 


Remember that your 
calculator needs to be in 
radian mode for this 
calculation. 


ae 


Figure 3.1 (a) Graph of y=sinz (b) Graph of y = sinz on (0, 27] 


The graph of sin is periodic, with period 27, so the same will be true of its 
derivative, which gives the gradient at each point of the graph. It suffices 
therefore to consider the portion of the graph in the interval |0, 27], as 
shown in Figure 3.1(b). Within this interval, sin has stationary points 
(zero gradient on the graph) at x = 51 and ¢ = Sr, so we must have 


fi(4n) = f'n) =0. 
The graph of sin has positive gradient on the intervals (0, $7) and ($7, 27), 
and negative gradient on the interval (57, $77), so 


f'(x) > 0 for x in (0,47) and for « in ($7, 27), 


f(x) <0 for x in (57, 27). 


This assists in sketching the graph of f’(x), since this graph is above the 
z-axis where f’(x) > 0 and below the z-axis where f'(x) < 0. By looking 
at the gradient in Figure 3.1(b), we see that the derived function f” is 
decreasing on the interval (0,7) and increasing on the interval (7, 27). 


Finally, we ask what are the greatest and least values that the gradient of 
sin can take within the interval [0,27]. The greatest value occurs where 
the graph is at its steepest increase, which seems to be at x = 0 and 

x = 2m. The least value occurs where the graph is at its steepest decrease, 
which seems to be at x = 7. The symmetry of the graph suggests that the 
gradient at x = 7 is the negative of the gradient at x = 0. 


The following activity indicates that the gradient at x = 0 of the graph of 
sin is close to 1. 


Activity 3.1 Calculating the gradient of sinatx=0 


According to equation (1.3), the derivative of f(a) =sinz at x = 0 is 


Har we { neh) — eee. fae 
F'(0) = fim (ROR) = i (F ). 


h—0 


Calculate the values of the ratio (sinh)/h for each of h = 0.1, 0.01, 0.001, 
giving your answers to six decimal places. To what limit does the ratio 
seem to tend as h — 0? 


A solution is given on page 56. 


The result of Activity 3.1 and the text beforehand suggest that if 
fiz) =emag; then 


PQ =1,. (Se= a) =k 


SECTION 3. DIFFERENTIATING FURTHER FUNCTIONS 


Adding this information to that obtained previously, we can sketch the 
graph of f’(x) as shown in Figure 3.2. 


Figure 3.2 Sketch graph of y = f’(x), where 
fle) =sine 


This looks like the graph of the cosine function! In fact, the derivative of 
sin is indeed cos; that is, 


if f(z) = sinew, thee 7 (2) = coex, (3.2) 


We could undertake a similar argument to make plausible the formula for 
the derivative of f(x) = cos, but there is a quicker approach, based on 
result (3.2). 


The graph of cos (derivative of sin) is shown in Figure 3.2. This graph is 
the same shape as that of sin, but translated by 51 to the left. Hence the 
graph of the derivative (giving the gradient) of f(x) = cosx has the same 
shape once again, but translated a further ST to the left. However, the 
outcome of translating the graph of cos x in this way is the graph 

of —sinz. This explains the result that 


if f(z) = cosa, then f’(x) = —sinz. (3.3) 
Exponential and logarithmic functions 


First recall the shape of graphs of the form y = a® (a constant), some 
instances of which are shown in Figure 3.3. 
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Figure 3.3 Graphs of y =.a* for a = 5, | ee 


Note that result (3.2) holds 
only when 2 is in radians. 
The same applies to 

result (3.3) below. 


Translating the graph of cos x 
to the left by $1 gives the 
graph of cos(a + $7), as 
explained in Chapter A3. 
Using trigonometric identities 
from Chapter A2, we have 
cos(x + $7) = cos(5m — (—2)) 
= sin(—x) = —sinz. 


As with the trigonometric 
functions earlier, it is not 
essential to follow the 
explanatory details here of 
how the derivative formulas 
for exp and In arise. If you 
are short of time, you may 
prefer to note results (3.5) 
and (3.6) below, and then 
read on from that point. 


Exponential functions were 
introduced in Chapter A3, 
Subsection 3.2. 


ot 


The value of e stored by your 
calculator can be found by 
evaluating e! = exp 1. 


Each of the graphs in Figure 3.3 passes through the point (0,1), since 

a° = 1 whenever a # 0. It seems that the greater the value of a, the larger 
is the gradient of the graph of y = a” at (0,1). For y = 2”, the gradient at 
(0,1) appears to be less than 1, whereas for y = 3” the gradient at this 
point appears to be greater than 1. This can be seen by comparison with 
the broken line through (0,1) which represents y = x + 1 and has 

gradient 1. The number e = 2.718... has the property that the gradient of 
the graph of y = e” at the point (0,1) is exactly 1, as illustrated in 

Figure 3.4. 


Figure 3.4 Graph of y = e® 


This defining feature of the number e was stated in Chapter A3. In terms 
of derivatives, it means that if f(z) = e”, then f’(0) =1. According to 
equation (1.3), this is the same as saying that 


O89 ee : 
lim (==) —1; thatis, lim (: a (3.4) 
h—0 h 


h—0O h 


The next activity asks you to check that this is the case. 


Calculate the values of the ratio (e” — 1)/h for each of 
h = 0.1,0.001, 0.000 01, giving your answers to six decimal places. To what 
limit does the ratio seem to tend as h — 0? 


A solution is given on page 56. 


It is now straightforward to find the derivative of f(x) = e* from 
definition (1.3). Thus we have 


f(ic+h)—f@ ee 
h = h 


= ———— (by arule for powers) 


| 

ay) 

8 
Me Ae 
ay) 
a 
| 
feat 
a 


On taking the limit as h — 0, we obtain 


fi(a)-= lim aera 


h—0 h 


— e” —] 
So: h 


=e” (by equation (3.4)). 


In other words, 
efile, thom yoy. a) 


This result states that the derivative of the exponential function is the 
exponential function! 


The following illuminating geometrical interpretation of this result leads on 
to the formula for the derivative of the function Inz. Suppose that P(x, y) 
is any point on the graph of y = e”, and that ¢ is the tangent to the curve 
at P; see Figure 3.5(a). By result (3.5), the line @ has gradient e*, which is 
also the ratio ‘rise over run’ for the triangle PQR that has its base QR on 
the x-axis. Since the ‘rise’ RP is y = e”, the ‘run’ QR must be 1. In other 
words, any tangent at (x,y) to the graph of y = e® intersects the x-axis 
one unit to the left, at (~ — 1,0). 


Figure 3.5 (a) Tangent ¢ to graph of y=e*” (b) Corresponding tangent ¢’ to 
graph of y = Inz 


This geometrical result helps us to deduce the derivative of the natural The function In was 
logarithm function In which, as you may recall, is the inverse of the introduced in Chapter A3, 
exponential function e*. The graph of an inverse function is obtained from Subsection 4.3. Section 4 of 


that of the original function by reflection in the 45° line, with that chapter also dealt more 
equation y = =. widely with inverse functions. 


Suppose then that the graph and lines in Figure 3.5(a) are reflected in the 
45° line, accompanied by a swapping of the labels x and y throughout, so 
we again have a horizontal x-axis and vertical y-axis. The outcome is 
shown in Figure 3.5(b), where P’, Q’, R’ are respectively the images of the 
points P, Q, R, and @’ is the image of the line @. The reflected graph has 
equation x = e” or, equivalently, y = Inz. 
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Now the line @’ is tangent to this graph at P’, and the gradient of @’, with 
reference to the triangle P’Q’R’, equals the ‘rise’ Q’R’ = 1 divided by the 
‘run’ R’P’ = x, that is, 1/x. We have therefore arrived at the following 
result: 


if fie) = ine eS Oy ae f(z) = - (eae. (3.6) 


The differentiation results encountered in this subsection are collected 
Although you may refer to together in the table below. 
the Handbook for these 


derivatives, they crop up so Table 3.1 
regularly in calculus that it is Function f(z) Derivative f’(x) 
probably worth trying to 
remember them. Cc 0 
a aa 
sin © COS & 
COs & —sin x 
a e 
lis. jf >e i/z 42 >0) 


3.2 Sum and Constant Multiple Rules 


The derivatives of some basic functions are given in Table 3.1. Sums and 
constant multiples of these basic functions can be differentiated using the 
These rules are following rules. 
straightforward consequences 
of definition (1.3) of the 
derivative, though the proofs Sum Rule 
are not given here. If k is a function with rule of the form k(x) = f(x) + g(x), where f 
and g are smooth functions, then k is smooth and 


k(x) = f'(#) + 9'(2). 


For example, if f(x) = x? and g(x) = sinz, then (from Table 3.1) 
f'(e) = 22 2 ame. 9 ez, 
so the function k(x) = x? + sin x has derivative 


k'(x) = 2x + cosa. 


Constant Multiple Rule 


If k is a function with rule of the form k(x) = cf(x), where f is a 


smooth function and c is a constant, then k is smooth and 


ki(x) = cf'(2). 


For example, if f(x) = x? and c= 5, then the function k(x) = 52” has 
derivative 


ieh=S «2a Te, 


These two rules are often used in combination. For example, if 
f(x) = 7x? — Asin x, then we can write immediately (using these rules) that 


f'(x) = 7(2x) + (—4)(cosx) = 14% — 4 cosa. 
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In fact, these two rules were in effect taken for granted earlier: 


© in Frame 4 of the audio tape, in giving the gradient formula for a See Subsection 1.2. 
general quadratic function, repeated in result (1.1); 


© in giving the formula for the derivative of any polynomial function, in See Subsection 2.1. 
the form of result (2.2) or (2.3). 


In future they will be used without comment. 


_Activity 3.3. Using the Sum and Constant Multiple Rules 


Differentiate each of the following functions. 
(a) f(x) =a*-2sinx ~ (b) g(x) = 3e7 —4 nz 


6 
(e)-pla)= 72 + 3cosz 


Solutions are given on page 57. 


3.3 Other notations for derivatives 


In Subsection 1.3 you saw the definition of differentiation of a smooth 
function f as the process of finding the derived function f’ of f. There are, 
however, other notations in common use for describing derivatives. Which 
notation to use depends partly on the context, as you will see. 


The function notation f’ is the preferred notation of those who study 

calculus principally as a branch of mathematics. It arises from the view 

that the fundamental objects within calculus are functions rather than 

variables. When calculus was developed, however, in the seventeenth 

century, today’s notion of ‘function’ did not exist. Newton and Leibniz The name of Leibniz is 
were concerned with geometric problems (for example, finding tangents to | pronounced ‘lie-b-nitz’. 
curves), so it was natural for them to work with the variables involved in 

the equations of curves. 


The notation invented by Leibniz can be used for any pair of related 
variables, and it makes explicit mention of these variables. For example, if 
the variables y and x are related by the equation 

y= a", 


then Leibniz would have spoken of ‘the derivative of y with respect to x’ 
and written this as 


ed — et od The derivative on the 
left-hand side may also be 
Here dy/dz has the same meaning as f'(x), where f is the function written as dy/dz. It is 
f(x) =x’. Note that the expression dy/dzx should be regarded as a pronounced as 
complete symbol since, despite its appearance, it is not a fraction in the ‘dee-y by dee-x’, or sometimes 


ordinary sense. In particular, the two ds which appear in it do not cancel. just as ‘dee-y dee-x’. 
Nor do the subexpressions dy and dx have any individual meaning. 


When the derivative for a particular value of the independent variable is 
required, the Leibniz notation has to be adapted. For example, if 
f(x) = x? then we have f’(3) =6. In Leibniz notation, with y = x”, this 
value of the derivative at x = 3 would be expressed as 

dy 


= 6, 
dx ee 


The Greek letter 6 (delta) is 
placed in front of a variable 
to indicate a small change in 
that variable. Symbols such 
as 0x and dy are ‘complete 
symbols’, not products. 


This matches the definition of 
derivative in equation (1.3), 
with h replaced by dx” and 
f(x +h) — f(x) replaced 

by oy. 


This is the situation which 
was considered in 
Example 1.2. 


See Subsection 2.2, in the 
text immediately after 
Activity 2.6. 


Gs= 
The symbol qe is 


pronounced as ‘dee-two-s by 
dee-t-squared’. 


Leibniz arrived at his notation by thinking of the derivative as a means of 

describing the ‘rate of change’ of a dependent variable as the independent 

variable changes. This may be visualised by starting from a point (a, y) on 
the graph of y = f(x), and then making a small change doz in x which leads 
to a small change dy in y; see Figure 3.6. 


a+ én 


Figure 3.6 Small change ox in x and 
corresponding small change dy in y 


The ratio dy/dzx is the ‘average’ rate of change of y with respect to x over 
the interval [2,2 + dz]. Leibniz used the notation dy/dzx to indicate the 
limiting value to which the quotient dy/dz approaches as 02 becomes 
small; that is, 

dy 4, Y 

as ae 
It is then natural to regard the derivative dy/dz as the ‘instantaneous’ rate 
of change of y with respect to x. While dy/dz is not itself a quotient, it is 
the limit of quotients of the form dy/dx as dx — 0. 


As mentioned earlier, the Leibniz notation can be used whenever two 
variables are related. For example, the equation 


A=nr’ 


relates the area A of a circle to its radius r. The derivative of A with 
respect to r is 


This represents the instantaneous rate of change of A with respect to r. 


As another example, if a car moving along a straight road has position 
s = 30t — t? metres from its starting point after t seconds, then its velocity 
v (in ms_‘) at time t is given by 
ds 

ao 30! Ze: 
For this example, a further differentiation with respect to t gives the 
acceleration a of the car, which we originally denoted by a = f(t) (where 
s = f(t)). In Leibniz notation, this is written as 


pee 
«dt? 


In general, if y = f(x), the function notation and Leibniz notation are 
matched by the statements 


= f(x) and SH= fe), 


If we have (say) s = 30t — t?, a variation of the Leibniz notation for the 
derivative is to write 


d d 9 

— ts es d 

dt (s) oF dt (30¢ t ). The notation m ) is 
The latter form enables us to write derivative results very concisely. For pepe as “dee-by-dee-t 
example, two of the derivatives from Table 3.1 can be expressed as oft }. 

d : d 

—(2#")=nz” ~~ and —(cosz) = —sinz. 

7 = ) 7 et ) 


The Leibniz notation is often used in applied mathematics, science and 
engineering, where there tends to be greater emphasis on variables than on 
functions. The variables frequently represent quantities with a physical 
interpretation, which are related to one another within a mathematical 
model. 


One other notation is worth mention here. In textbooks on motion, 
especially in physics and engineering, differentiation with respect to time t 
is often represented by a dot above the dependent variable, as follows: 


— 
s=—. 
dt 
This notation was devised by Newton, who used calculus to study motion. 
The corresponding second derivative is denoted by 
= —S 
s= =, The expression § is 
pronounced as ‘s double dot’. 


The expression § is 
pronounced as ‘s dot’. 


Find each of the following derivatives, using the notation indicated. 
dV d*V 


a.) == and 7 where V = orr* 


( 
(b) f’(5) and f’”(3), where f(x) = 45lnz 
ds 
) a 

t=2 
(d) & and tw, where u = 3cost 


, where s = 10 — 5¢? 


Solutions are given on page 57. 


This section concludes with a video band that provides a further 
explanation of most of the results in Table 3.1, page 40. This video band, 
which uses the notation exp z rather than e* for the exponential function, 
includes animation of tangents to curves and use of a hypothetical device 
called a ‘gradient meter’. 


Now watch Video Band C(i), ‘Visualising differentiation’. 


Summary of Section 3 


This section has introduced: 


© the formulas for the derivatives of some basic functions, as listed in 
Table 3.1 on page 40; 


© the Sum and Constant Multiple Rules for differentiating combinations 
of smooth functions; 


© the Leibniz notation and Newton’s notation for derivatives. 


Exercises for Section 3 


Exercise 3.1 
Differentiate each of the following functions. 
(a) u(a).=4sine — Seosz (b)..ahe) = fips, = 2e* 
fou 
= ga = 2 1--+-— 
(c) wha) =e z+ : i = 
Exercise 3.2 


Find each of the following derivatives, using the notation indicated. 


e cP 
(a) = and ae where P = cosu — sin u 
(b) v and u, where v = —e’ + 17¢+3 
d 
(c) ee | where w = 5s4 — 10s? 
ds s=3 


In Subsection 3.1 you saw the derivatives of various basic functions, and in These derivatives are given in 
Subsection 3.2 you met the Sum Rule and Constant Multiple Rule for Table 3.1 on page 40. 
differentiating sums or constant multiples of smooth functions. By using 

Table 3.1 and these two results you can differentiate many functions, 

including all polynomial functions. 


In this section you will see how to differentiate various types of function 
whose rules have a more complex structure. The three subsections deal in 
turn with functions that can be viewed as products, quotients or 
composites of a pair of more basic functions. At the end of Subsection 4.3, 
a table of derivatives is given which is a more general version of Table 3.1. 


4.1 Product Rule 


Here is an example of a function that cannot be differentiated using the 
Sum and Constant Multiple Rules: 


k(x) = 2 sina, (4.1) 


This function, k(x) = x? x sin z, is the product of the two basic functions 
f(x) = x? and g(x) = sinz, each of which you already know how to 
differentiate. In order to differentiate k, therefore, we need a ‘product rule’ 
for differentiation. Such a rule is available, and is stated below. 


Product Rule 


If k is a function with rule of the form k(x) = f(x)g(x), where f and This rule and the others 
g are smooth functions, then k is smooth and quoted in this section are 
consequences of 
k(x) = f'(x)g(x) + f(x)g' (2). (4.2) definition (1.3) of the 


derivative, though the proofs 


, are not given here. 
The right-hand side of equation (4.2) is the sum of two terms, each of 


which is the product of one of the constituent functions, f or g, with the 
derivative of the other. 


For example, if k(x) = x? sin x, as in equation (4.1), then k(x) = f(x)g(z), 
where f(x) = x? and g(x) = sinz. Now f’(x) = 22 and g’(x) = cosz so, by 
the Product Rule, we have 


k(x) = f'(x)g(x) + f(x)9' (x) 
= (2x)(sin x) + (x*)(cos x) 
= 27 sinz + x cosa. 


Here are some similar examples for you to try. 


By first identifying each of the following functions as a product of the form 
f(x)g(x), and then using the Product Rule, differentiate each function. 


(a) k(2)= aoe (b) k(x) = (227 — 1) cosa (c) k(x) = /z(a +sinz) 


Solutions are given on page 57. 


Both this and the earlier form 
of the Product Rule can be 
written in different ways. For 
example, the right-hand side 
here can also be written as 


There is no danger of dividing 
by 0 here since any points x 
for which g(x) = 0 are not in 
the domain of k. 


With practice, it is possible to apply the Product Rule informally, without 
assigning names to the two constituent functions, by remembering in 
words that the derivative of ‘product = (first) x(second)’ is given by 


derivative derivative 


derivative 
fe ae ae ( of first ) fr leeraae SE) x a cau 


You should aim to be able to differentiate a function which is a product in 
this informal manner because it is often inconvenient to give the names f 
and g to the constituent functions, as the following activity demonstrates. 


Differentiate each of the following functions. 
(a) f(x) = (3a? + 2x + 1)e? (b) g(x) =sinxcosx 
(c) fl) =@ +3) 


Solutions are given on page 57. 


In Subsection 3.3 we described the alternative Leibniz notation for 
derivatives. In terms of this notation, the Product Rule may be expressed 
as follows. 


Product Rule (Leibniz form) 
If y = uv, where u = f(x) and v = g(z), then 


dy du dv 


: a 


To compare this equation with equation (4.2), it is necessary to let 
y =e). 


4.2 Quotient Rule 


Now consider again the basic functions f(x) = x? and g(x) =sinz. The 
function 


aha 


cannot be differentiated using the Sum, Constant Multiple and Product 
Rules. In this case k(x) is the quotient of f(a) and g(x); the function k 
can be differentiated using the following rule. 


f(@) _ 


g(x) sing a 


Quotient Rule 


If k is a function with rule of the form k(x) = f(x)/g(x), where f and 
g are smooth functions, then k is smooth and 


fo ee (4.4) 


pacman) 


The numerator of the right-hand side of equation (4.4) is the difference of 
two terms, each of which is the product of one of the constituent functions, 
f or g, with the derivative of the other. 


For example, if k(x) = x?/sin x, as in equation (4.3), then 
k(x) =-f(@)/ g(x); where f(x) =-2*-and. g(2)-= sin z.. Now f'(x) = 2x and 
g'(x) = cosx so, by the Quotient Rule, we have 


g(x) f(z) — f(x)g'(x) 
— (g(a) 
= (sin x)(2x7) — (*)(cos x) 
(sin a)? 
2 ain = 2 ease 


sin? x 


Here are some similar examples for you to try. 


By first identifying each of the following functions as a quotient of the form 
f(x)/g(x), and then using the Quotient Rule, differentiate each function. 


(a) Kx)=—  (b) k@) = 


Solutions are given on page 57. 


os x x+1 


-  () K(x) = 


x—-1 


As with the Product Rule, you should aim to apply the Quotient Rule 
without assigning names to the constituent functions, by remembering in 
words that the derivative of ‘quotient = (top)/(bottom)’ is given by 


ee derivative ae ee deca 
of top P of bottom 


derivative \ _ 
of quotient / (bottom)? 


uotient Rule informally 


Differentiate each of the following functions. 
sin £ Inu t 
perros b = 
@) f=) (w= 4 
Solutions are given on page 58. 


Comment 


Since tanx = sinx/cosz, the result of part (a) shows that the derivative of 
tia) atone we f' (ee ecr x 


In Leibniz notation, the Quotient Rule may be expressed as follows. 


Quotient Rule (Leibniz form) 


li y = u/v, where u= f(#) and:v =.9(2 0, then As with the Product Rule, 
/ Ha) ga) # the right-hand side here can 
be written in other ways, for 


example, as 


ifau dv 
ee ee AG 


H=S (5 =) 


dx w\ dx dx 


The composite function 

k(x) = g(f(x)) is defined only 
if any possible output from f 
is an allowable input for g. In 
the language of Chapter A3, 
Section 1, the image set of f 
must be contained within the 
domain of g. 


Note that cos? 7 means 
(cos x). 


4.3 Composite Rule 


In the previous two subsections, we used the basic functions f(#) = «* and 
g(x) = sina to construct the product f(x)g(a) = x*sinx and the quotient 
f(x)/g(x) = x?/sinx. There is yet another way in which these two basic 
functions can be combined to form a different type of function. 


If we first apply the function f to x, to obtain f(x), and then apply the 
function g to f(x), we obtain the function 


k(x) = o(f()) = sin(2’). (4.5) 
A function k with rule of the form k(x) = g(f(x)) is called a composite 


function, or informally a ‘function of a function’. The expression g(f(z)) 
can be thought of as 


g(u), where u = f(a). 


The intermediate variable u labels the outputs from the ‘inner’ function f, 
which are also the inputs to the ‘outer’ function g. The rule for g in the 
case of equation (4.5) can be expressed as g(u) = sin u. 


Some other examples of composite functions are: 
k(x) =e**, where u= f(x) = 2x and g(u) =e"; 
2 


k(x) = cos*x, where u= f(x) =cosz and g(u) =u’. 


Activity 4.5 Recognising composite functions — - 


Identify each of the following as a composite function of the form 
k(x) = g(f(x)), by specifying the rules for the ‘inner’ function u = f(z) 
and for the ‘outer’ function g(w). 


(a) k(z) = Ine +) (b) k(x) = cos(z*) ey Sia) = coe" se 
(a) k(x) =e 


Solutions are given on page 58. 


To differentiate composite functions, we use the following rule. 


Composite Rule 
If k is a function with rule of the form k(x) = g(f(x)), where f and g 


are smooth functions, then k is smooth and 


k(x) = g'(f(@)) fF (2). (4.6) 


This rule needs to be applied carefully. The right-hand side of 

equation (4.6) is a product of two derivatives, but the first of these involves 
the function g’ evaluated at f(x). When first learning how to use the 
Composite Rule, it is helpful to replace f(x) by the variable wu, as in the 
text about composite functions above. 


For example, if k(x) = sin(x?), as in equation (4.5), then k(x) = g(f(2)), 
where u = f(x) = x? and g(u) =sinu. 


SECTION 4. PRODUCTS, QUOTIENTS AND COMPOSITES 


Now f’(a) = 2a and g'(u) = cosu so, by the Composite Rule, we have 
k(x) = g'(f(a)) fF (2) 
=g'(u)f'(x) (where u= f(x) = 2°) 
(cos u) (22) 


= cosir”) x 2. (sinceu = x7) 


= 2p conla*). 


Here are some similar examples for you to try. 


Activity 4.6 Using the Composite Rule 


By first identifying each of the following functions as a composite of the 
form k(x) = g(f(x)), and then using the Composite Rule, differentiate 
each function. 


(a) k(x) =e" ~— (b) K(x) =sin(2x) ~—(c)_-k(x) = cos(e”) 


Solutions are given on page 58. 


Once again, you should aim to apply the Composite Rule without 
assigning names to the constituent functions, by remembering in words 
that the derivative of ‘composite = outer(inner)’ is given by 


( derivative - oo) a) If k(x) = g(f(x)), then f is 


of composite of outer of inner the inner function and g the 
outer function. 


However, you must also remember that the derivative of the outer function 
g is evaluated at f(x), which means that the ‘words’ version above is not 
self-contained in the manner of the corresponding statements for the 


Product and Quotient Rules. Although you should aim eventually to be You should explicitly assign 
able to differentiate a function which is a composite in this informal names to the constituent 
manner, mastery may take longer than with the earlier rules. functions for as long as you 


find this helpful. 
: Activity 4. 7 Using the Composite Rule informally =—— 


Differentiate each of the following functions. 
(a) f(x) = cos(4a) (b) g(x) = In(v? +241) (c) u(t) = sin(cos ft) 


Solutions are given on page 58. 


The Leibniz form of the Composite Rule is particularly memorable. When 
stated in this form, the result is usually referred to as the Chain Rule. 


Chain Rule (Leibniz form of Composite Rule) 
If y = g(u), where u = f(x), then 


dy dy du 


dx dudx 


This version has the merit (for memorising its form) but also the danger of 
looking like an obvious statement about fractions (which it is not!). 
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For example, if y = sin(x?), then y = sinu, where u = x”, so 


ee ==) 608 &, = =27 and 
ed = ae = (cos u)(2x) = 2x cos(x’). 
Finally, note that the Composite Rule enables us to generalise somewhat 

You have already the results of Table 3.1, to cope with functions such as sin(ax), cos(ax), 
differentiated specific e* and In(ax), where a is any non-zero constant. For suppose that the 
functions of this type: derivative of the function g is known, and that the function k is defined by 
sin(22z) in Activity 4.6(b), and &(z) = g(ax). This has the form of a composite function k(x) = g(f(2)), 
cos(4a’) in Activity 4.7(a). for which the inner function is f(x) = ax, with derivative f’(x) =a. It 


follows from the Composite Rule that 


k(x) = 9 (f(x) ) fF (&) = ag’ (aa). 
By applying this result to the functions in the last four lines of Table 3.1, 


This table, in which a is a we obtain the amended table below. 
non-zero constant, appears in 


the Handbook. Note that the Table 4.1 
results of Activities 4.6(b) Function f(z) Derivative f’(z) 
and 4.7(a) are consistent with 
entries in this table. C 0 
ri Ce ee 
sin(ax) a cos(a2x) 
cos(az) —asin(ax) 
2 er 
In(az) (ax > 0) le jag >) 


Summary of Section 4 


This section has introduced: 


© the Product Rule, Quotient Rule and Composite Rule (Chain Rule) 
for differentiating functions which are appropriate combinations of 
other more basic functions; 


© the formulas for the derivatives of some basic functions of the 
form f(x) = g(ax), where a is a constant, as listed in Table 4.1 above. 


Exercises for Section 4 


Exercise 4.1 


Use the Product Rule to differentiate each of the following functions. 
(a) flie-=— (e+ epee (b} gla) =e? cose (c) u(t) = VtInt 


Exercise 4.2 


Use the Quotient Rule to differentiate each of the following functions. 


xt mt —71 1 


a) 9) = _— a 


In x et COS £ 


Exercise 4.3 


Use the Composite Rule to differentiate each of the following functions. 
(a) k(x) = sin(Inz) (b) k(x) = sin* x (e) May =e? 


(d) p(t) = In(e’ +t) (e) g(s) = cos @ (fil) = / 


cos? y 


To study this section you will need access to your computer, together with 
Computer Book C. 


In this section you will see that the computer can be used to find the 
derivative of any smooth function whose rule is entered. This means that, 
even where the function concerned is too complicated to permit the general 
differentiation results in this chapter to be applied with ease ‘by hand’, an 
answer can still be obtained from the computer. Equally, the computer can 
be used to provide an independent check on any calculation done by hand. 


Note, however, that this does not absolve you from the need to be able to 
differentiate by hand! Just as with calculators, there are certain 
calculations in differentiation which are sufficiently simple (with practice) 
to make it worthwhile to do them by hand rather than by switching on the 
computer. 


In more complicated cases the computer may provide the only viable 
option, and can then be extremely useful. As you will see, it can be used 
also to find precisely and to classify stationary points and to apply the 
Optimisation Procedure described in Subsection 2.3. Since the graph of 
any function can be plotted rapidly on the computer, it provides an 
efficient alternative to the graph-sketching techniques introduced in 
Subsection 2.2. 


Refer to Computer Book C for the work in this section. 


Summary of Section 5 


In this section you saw how the computer can be used to find derivatives 
symbolically, and you applied this facility in the context of examples which 
called for optimisation. 


You are expected to be able 
to differentiate ‘by hand’ any 
function appearing in 

Table 4.1 on page 50, together 
with any constant multiples, 
sums, products, quotients or 
composites of these functions. 


You saw a rough form of 
optimisation, based on a 
plotted graph, carried out in 
Chapter A3, Section 5. 


Summary of Chapter C1 ee 


In this chapter you met differentiation, which is the process that allows 
calculation of the rate at which one variable changes with respect to 
another variable. This can be visualised as the gradient of a corresponding 
graph. The process can be used to find the optimum value of a function 
over an interval, or to assist in sketching the graph of a function. 


Table 4.1 on page 50 lists the derivatives of certain basic functions. Other 
functions may be built up from these by taking constant multiples, sums, 
products, quotients or composites of them. The chapter introduced rules of 
differentiation for all of these cases. 


Learning outcomes 


You have been working towards the following learning outcomes. 


Terms to know and use 


Calculus, gradient, tangent, smooth, rate of change (average, 
instantaneous), derived function, derivative, differentiation, power 
function, quartic function, local maximum, local minimum, stationary 
point, increasing/decreasing on an interval, second derived function, 
second derivative, optimum value, optimisation, constant function, 
composite function. 


Symbols and notation to know and use 
The various notations for first and second derivatives: 
© f'(x), f(x) (function notation); 


dy d*y d d Hid | 
ae? dad? ay WY) ay ot (=) (Leibniz notation); 


© 8,8 (Newton’s notation). 


© 


The notation for limits, as applied in the definition of the derivative: 


eget 
© (2) = jim (SE 

dy... . Of ie 
© FP Jim. = (Leibniz notation). 


) (function notation); 


Mathematical skills 


© Differentiate ‘by hand’ any of the basic functions whose derivatives are 
given in Table 4.1 on page 50. 


© Use the appropriate rules to differentiate any constant multiple of a 
basic function, or any sum, product, quotient or composite of a pair of 
basic functions. 


© Use differentiation to find instantaneous rates of change, including 
velocity (given a position function for an object), and evaluate the rate 
of change at any value of the independent variable. 


© Use differentiation to find the gradient of a graph, and evaluate the 
gradient at any point on the graph. 


SUMMARY OF CHAPTER C1 


© Apply the limit definition of the derived function/derivative in very 
simple cases. 

© Differentiate twice to find second derivatives. 

© Apply tests to determine whether a stationary point is a local 
maximum or a local minimum (or neither). 

© Draw a rough sketch of the graph of a simple function, based on the 
location and classification of stationary points and the position of the 
y-intercept. 

© Find the maximum or minimum value of a function on an interval. 

Mathcad skills 

© Apply Mathcad to differentiate an expression with respect to a 
specified variable, by using Mathcad’s d/dx operator and symbolic 
evaluation. 

© Use the copy and paste facilities in Mathcad. 


Ideas to be aware of 


© 
© 


The difference between average and instantaneous rates of change. 


That the graph of a smooth function has a gradient at each point on 
it, as given by the gradient of the tangent at the point. 


That the derived function (derivative) of any smooth function is found 
by differentiation, and is equivalent both to the gradient of the graph 
of the function and to the instantaneous rate of change of the 
dependent variable with respect to the independent variable. 


Some possible uses for differentiation. 


The concept of an optimum value, and its relationship with the 
derivative of a function. 


Solution 1.1 

In each case we apply result (1.1), which states that 

if f(x) = ax* + br +c, then f’(x) = 2ax + b. 

(a) For f(z) = 27 +2+7, we have a= 1, b=1 and 
e= 1,80 7 ay = 22 FE. 

(b) For f(x) = 5x2? — x, we have a = 5, b = —1 and 
c= 0, so f"(a) = 10e—1. 

(c) For f(x) =5+ 32 — 42°, we have a = —3 


ae 2? 
and-¢= 5,60 f'(z) = -e 7g 


b=3 


Solution 1.2 


(a) Result (1. - gives f’(x) = 2x, so the gradient at 
the point e +), where x = s, is 
f= 2g 

(b) Result (1.1) a f'(x) = —2z, so the gradient 
at the point (3, —8) is f’(3) = —2 x 3= —6. 

(c) Result (1.1) gives f’(z) = 62 + 32, so the 
gradient at the point (1, —207) is 
Co ee 


Solution 1.3 
(a) From Example 1.1, we have 
(2 +h)? = a? + 327h + 3rh? + h’, 
so we deduce that 
(c+h)* =(x4@+h)(x +h) 
= (¢+h)(x° + 32°h + 32h? + h®) 
= 24 + 4h + 67h? + 4h? + hi. 
(b) By equation (1.3), we need to consider the 
quotient 


fle +h) — f(a) 
h h ; 
where h is a non-zero number. Using the 
expansion for (x + h)* from part (a), we have 


f(a+h)- 
h a h 
— Ag? + 627h + 4h? + h?. 


f(x) — 40°h + 6x?h? + 4h? + h* 


For small values of h, the expression 

Ax? + 627h + 4rh? + h3 is close to 4x° because 
each of 6x7h, 42h? and h? is small. Thus in the 
limit as h — 0, we obtain f’(x) = 42°. 


(c) The — (5,75) 
= ==, for which the gradient is 
P(g) = 4(4)8 = 408) = 3 


on the graph corresponds to 


Solution 1.4 


In each case, we apply result (1.4), which states that 
Wt) = 2", then fis) =a 


(a) Here f(z) ==2"°, 
f'(a) = 62° 
(b) Here f(x) = 1/2" 
derivative is 
f(x) = 
which is often written as f’(x) = —3/2*. 
(c) Here f(a) = 2 =2'/?, son = 4. The 
derivative is 
f(x) = 11/21 = 13/2, 


so n = 6. The derivative is 
= 62°. 
<*>. s0n = —3. The 


ager a 


which is often written as f’(x) = 1/(2,/2). 


Solution 1.5 
(a) The position function is s = f(t) = 5t?. This isa 


quadratic function of t, so we apply result (1.1) 
to obtain the velocity function 
v= fil 
(b) When t = 3, we have v = f’(3) = 10 x 3 = 30, so 
the velocity is then 30 ms~! 
(c) When v = 15, we have 15 = 10¢, which has 


solution t = 1.5. Hence the stone has velocity 
15 ms~? after 1.5 seconds. 


Solution 2.1 


(a) From result (1.4), the derivative of x® is 8x’ and 
the derivative of x° is 52+. From result (2.3), the 
derivative of the polynomial function 
f(x) = 62° — 32° is 

f'(x) = 6(derivative of «°) 
= 6(82") — 3(5z7) 
= 48a" — 152°. 

(b) From results (1.4) and (2.3 
the polynomial function 
f(z) = —a? + 527 — 7x + 15 is 

f'(a) = —(32*) + 5(2x) — 7(1) 
= —32° + 10x — 7. 
The point (1,12) on the graph of y = f(z) 
corresponds to x = 1, for which the gradient of 
the graph is f’(1) = —3(1)? + 10(1) -7 = 0. 


— 3(derivative of x°) 


), the derivative of 


(This outcome indicates that the graph of the 
given function f(x) has a horizontal tangent at 
the point (1, 12).) 


Solution 2.2 
(a) From results (1.4) and (2.3), the derivative of 


the polynomial function 
f(x) =2° — 627 — 152 + 54 is 
f'(x) = (3a*) — 6(2a) — 15(1) 
= 32° — 122 — 15 
Cg ae 
(b) The equation f’(2) = 0 is equivalent to 
o”? —de = 5 = 0; that is; (e+ 1)(e— 5) =0: 


This equation has solutions x = —1 and x = 5. 
Thus the stationary points of f are at x = —1 
endo = 5, 

Solution 2.3 


(a) For « < —1, we have 
x+1<0 and x-5<-6<0, 


so f’(x) = 3(a + 1)(a — 5) > 0 (since f’(z) is the 
product of 3 and of two negative numbers). 


For —1 < x < 5, we have 
x+1>0 and «r-5<J, 


so f’(x) < 0 (since f’(x) is the product of 3, one 
other positive number and one negative 
number). 


For x > 5, we have 
Ctl > 6 >O end e545 > 0, 


so f’(x) = 3(x4 + 1)(a — 5) > O (since f’(x) is the 
product of 3 and of two other positive numbers). 


(b) By the Increasing/Decreasing Criterion, f is 
increasing on (—oo, —1) and on (5,00), but 
decreasing on (—1, 5). 


Solution 2.4 


For the stationary point at « = —2, we have to 
choose test points xy, to the left 

of —2 and xp to the right of —2 such that there are 
no stationary points between x; and —2 nor between 
—2 and xr. Bearing in mind Remark 2, page 27, and 
the fact that the only other stationary point is at 1, 
we can take (say) x, = —3 and rpg = 0. (Another 
choice is z, = —3 and rR = —1, but choosing rR = 0 
simplifies the calculation of f’(xp).) 


For the stationary point at x = 1, we have to choose 
test points xyz, to the left of 1 and rp to the right of 1 
such that there are no stationary points between 21, 
and 1 nor between 1 and rr. Bearing in mind 
Remark 2, page 27, and the fact that the only other 
stationary point is at —2, we can take (say) x, = 0 
and Te = 2. 


Solution 2.5 
Step 1: ‘The derivative of the function 
f(x) = 32+ — 2x? — 9a? +7 is 
f(a) = 12x" — 62" = 182 
=ee = o 2) 
= 67(2% — 3)(x + 1). 
Solving the equation f’(x) = 0, we find that the 
stationary points of f are at x = —1, x = 0 
and 2 3 


Step 2: ‘To classify the stationary point at x = —1, 
we choose test points x, = —2 and rR = —s, say. 
Then we have . 


f(a.) = f'(-2) = 6(—2)(—7)(-1) = —84 < 0, 
f'(ar) = f'(-3) = 6(-3)(-4)(g) = 6 > 0, 
so f has a local minimum at x = —1, by the First 
Derivative ‘Test. 


To classify the stationary point at x = 0, we choose 
test points 7, = —$ and ©R = 1, say. Then we have 


f'(au) = f'(—3) =6 > 0, 
f(r) = fF (1) = 6()(-1)Q) = -12 < 0, 


so f has a local maximum at x = 0, by the First 
Derivative Test. (Note that the value of f’ (—5) was 
found when classifying the stationary point at 

g= 1} 


To classify the stationary point at 7 = 3 we choose 
test points z, = 1 and rR = 2, say. Then we have 


a ee = Ff (1) = —12 < 0, 

f'(tr) = f'(2) = 6(2)(1)(3) = 36 > 0, 
so f has a local minimum at x = 3 by the First 
Derivative ‘Test. 
Solution 2.6 


The second derivative of the function f is the (first) 
derivative of the function f’(x) = —3x? + 10x — 7. 
From results (1.4) and (2.3), this is 


f" (x) = —3(2x) + 10(1) 
oe + 10, 


Solution 2.7 


The second derivative of f is obtained by 
differentiating f’(2) = 122° — 6x? — 182, to obtain 


Vie =e’ — 12a — 18 = 662" — 22-23), 
For the stationary point at 7 = —1, we have 
fF (-1) = 6(6(—1)* = 2-1) — 3) = 30 = 0: 


so f has a local minimum at x = —1, by the Second 
Derivative ‘Test. 
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For the stationary point at x = 0, we have 
f’" (0) = 6(—3) = -18 < 0, 


so f has a local maximum at x7 = 0, by the Second 
Derivative ‘Test. 


For the stationary point at x = 3 we have 
#'(3) = 6(6(3)" — 2(3) — 3) 
= 6(44 —3- 3) =45>0, 


so f has a local minimum at x = 3 by the Second 
Derivative Test. 


(These classification results agree with those that you 
found using the First Derivative Test in Activity 2.5.) 


Solution 2.8 


The function is f(x) = 32+ — 27° — 9x7 + 7, and the 
y-coordinates of stationary points on the graph of 
y = f(a) are 

f(-1)=3 $2794 28) (OS Poe 


ay 2. 233 at 81 hee (ae 13 
es ee deer Waa aes oat Cram 


Hence the graph passes through the points (—1,3), a 
local minimum, (0,7), a local maximum (on the 
y-axis), and (2, —473), a local minimum. A sketch of 
the graph is as follows. 


(0,7) 


(—1, 3) 


(3,-478) 


Figure S.1 


Solution 2.9 


In each case, we follow the steps of the Optimisation 
Procedure. 


(a) Step 1: The derivative of the function 
f(x) = $23 — 2? — 8x41 is 
f'(z) = a2? — 22 —8 = (x — 4)(x + 2), 
so the stationary points are at x = 4 and 
a 


Step 2: At the endpoints of the interval 
I = [—3, 5], the function values are 


f(-3)=7 and f(5)=—% =—222, 


while at the stationary points (which are both 
inside J) the function values are 


Step 3: The greatest value of f(x) on |—3, 5] is 
10% (at 2 = —2) and the least value of f(x) on 
[—3, 5] is —252 (at x = 4). 


(In this case, both of the optimum values occur 
at stationary points within the interval.) 


(b) Step 1: The derivative of the function 
f(x) = 32? — 2x7 +5 is 


f' (xz) = 6a — 2 = 2(3a - 1), 


so the only stationary point is at 7 = 5. 


Step 2: At the endpoints of the interval 
I = |—1,4], the function values are 


f(—1)=10 and f(4) = 45, 


while for the stationary point (which is inside [) 
the function value is 

f() = 48 
Step 3: The greatest value of f(a) on |—1, 4] is 
45 (at x = 4) and the least value of f(a) on 
[—1,4] is 42 (at x = 4). 


Solution 3.1 


Evaluating the ratio (sinh)/h for each of 

h = 0.1, 0.01, 0.001 in turn gives the respective 
answers (to 6 d.p.) 0.998 334, 0.999 983 and 

1.000 000. The limiting value as h — 0 appears to 
be 1. 


Solution 3.2 


Evaluating the ratio (e” — 1)/h for each of 

h = 0.1, 0.001, 0.000 01 in turn gives the respective 
answers (to 6 d.p.) 1.051709, 1.000500 and 
1.000005. The limiting value as h — 0 appears to 
be 1. 


_ SOLUTIONS TO ACTIVITIES 


Solution 3.3 (c) In this case k(x) = f(x)g(x), where 
f(z) = /x = «'/? and g(x) = x+sinz. Since 
f'(@) = ta-¥/? = 1/(2/a) and 


(a) The derivative of f(x) = x* — 2sinz is 


fie = 4a" — eee g(x) = 1+ cosz, we have 
(b) The derivative of g(x) = 3e” — 41nz is k'(x) = f’(x)g(x) + f(x)g' (2) 
4 1 
g@)=3= _ = (=) (2 + sin) + (/x)(1 + cos z) 
(c) Since 1/x* = x~? has derivative _r+sne 4 eae 
—2x7~% = —2/x?, the derivative of 2/X vel ) 
p(x) = 6/x? + 3cosz is This may also be written as 
Lb 
p'(«) = —-> — 3sine. ia x+sina + 2,/x,/x(1+ cos x) 
e aye 
Solution 3.4 a eee OOS 
a 2./2 
(a) The first and second derivatives of V = sm? e 
with respect to r are Solution 4.2 
2 
ad a sa — (a) The derivative of f(x) = (3x7 + 2x + 1)e” is 
. 4 
/ = x 2 x 
(b) The first and second derivatives of f(x) = 45Inx Pee e+ Ge + 2n + 1)(e") 
45 45 b) The derivative of g(2) = sinx cos is 
/ = ite 
f'(@)=— and f"(2)=-S, ) @) 


g (x) = (cos x)(cos x) + (sinx)(— sin x) 


— cos’ x — sin’ z. 


c) The derivative of f(t) = (t? + 3)Int is 
(c) 


so we have 
Tis =o ae PS 5. 
(c) The derivative of s = 10 — 5t? with respect to t is 


1 
ss " f(t) = (2e)nt) + +3) (3) 
i —10%, so a = —20. 
; “le=2 pe — 
(d) The first and second derivatives of u = 3cost t 


with respect to t are 


s Solution 4.3 
“= -d9me antl (Absa est. 
(a) In this case k(x) = f(x)/g(x), where f(x) = e” 
Solution 4.1 ic =*. Sines f(r) = &-and 9’ fe) = 1, 
(a) In this case k(x) = f(x)g(x), where f(x) =a i : ; 
and g(x) = e*. Since Fa) = lane ¢ (x)= e*, k' (x) = g(x) f'(x) — F(z)9"(x) 
we have (g(x))? 
k(x) = f'(a)g(x) + f(a)a' (a) eee 
= (1)(e*) + ()(e*) ie 
=e (1 42), = a 
(b) In this case k(x) = f(x)g(ax), where (b) In this case k(x) = f(x gla ), where 
f(x) = 227 — 1 and g(x) = cosz. Since f(x) =cosz and g(x) = x? — 1. Since 
f'(x) = 4a and g’(x) = —sinz, we have f'(x) = —sinx and g’(x) = 22, we have 
k(n) = f'(a)al@) + Flag! (2) wa) = LL) = Hla)g'(e) 
= (47)(cos x) + (227 — 1)(—sin x) (g(x))? 
= 4¢ cos x — (2x* — 1)sinz. = _ (z* —1)(—sinz) — (cosz)(2z) 
@?— 1p 
= (x? —1)sinz + 2x cosz 
a? — 1p 


ot 


CHAPTER C1 DIFFERENTIATION AND MODELLING = 


(c) In this case k(x) = f(x)/g(x), where 
ft) = 2-1) and ge) = & = 1: ‘Since 
f(z) = 9 (2) = bre owe 


_ gx) f(x) = fla)g'(@) 


a om)? 
_ @-1)0)-(#@+)@) 
(oie 
= a 
Cae s = 
Solution 4.4 


(cos x)(cos x) — (sin x)(— sin x) 


(a) The derivative of f(x) = sinx/cosz is 
(cos x)? 


f(a) = 


cos? «+ sin’ x 


COS? & 
3 


~ cos? 2 
Since 1/ cosa = seca, this shows that 
Pie) =e" &: 


(b) The derivative of g(u) = Inu/(u? +3) is 
iu) = LEE SMA/w) = (in) (2u) 


(u2 + 3) 
i u* +3 —2u*lnu 
alu? + 3)? 


(c) The derivative of p(t) = t/(e’ — 1) is 


(e. DO) 


p(t) ae (et = 1)? 
ee ae 
-e-F 


Solution 4.5 


(a) The function k(a) = In(x? + 1) is of the form 
k(x) = g(f(x)) with 
u=f(x)=a2°+1 and gay=—mu. 
(b) The function k(x) = cos(x*) is of the form 
k(x) = g(f(x)) with 
u=f(z)=a° and g(u) =cosu. 


(c) The function k(x) = cos? z = (cos x)® is of the 


form k(x) = g(f(x)) with 


u=f(x)=cosx and g(u) =u". 


(Note that this case involves the same 
constituent functions as in part (b), but applied 
in the reverse order.) 

(d) The function k(x) = e*'"* is of the form 
k(x) = g(f(#)) with 


u= fie) =sns end git) =<. 


(because cos” x + sin? x = 1). 


Solution 4.6 


(a) In this case k(x) = e® = g(f(x)), where 
u = f(z) = 2° and g(u) = e". Now f'(x) = 3a? 
and g’(u) = e”, so we have 


k'(x) = g'(f(x)) fF (2) 


= g/(u)f"(a) (where u = f(x) = 2%) 
= (e")(30*) 
= 3g7e" (since u = 2°). 


(b) In this case k(x) = sin(2x) = g(f(x)), where 
u = f(x) = 2x and g(u) =sinu. Now f'(z) = 2 
and g’(u) = cos u, so we have 


k'(a) = g'(f(a)) f(a) 


=g'(u)f'(x) (where u = f(x) = 22) 
= (cos u)(2) 
= 2eos(22) “(since « = 22). 


(c) In this case k(x) = cos(e*) = g(f(x)), where 
u = f(x) =e" and g(u) = cosu. Now f'(z) = e” 


and g/(u) = —sin u,'so we have 
ki(x) = 9'(f(x)) f(a) 
= g/(u)f'(e) (where u= f(e) =e") 
= (—sinu)(e*) 
= —e*sin(e”) (since u =e”). 


Solution 4.7 


In each case, a concise solution (obtained by 
‘informal’ use of the Composite Rule) is followed by 
an expanded version. Note in the expanded versions 
for parts (a) and (b) that f and g cannot be used to 
denote the ‘inner’ and ‘outer’ functions, since one of 
them appears as the name of the given composite 
function. Hence we label the composite as q(p(x)) 
instead. 


(a) The derivative of f(x) = cos(4z) is 
f'(@) = (—sin(4x))(4) 
= —4sin(4z). 
Expanded version: In this case 


f(z) = cos(4x) = q(p(«)), where u = p(x) = 42 
and q(u) = cosu. Now p’(a) = 4 and 


q'(u) = —sinu, so we have 
f'(x) = q'(p(x))p'(z) 
=q'(u)p'(x) (where u = p(x) = 4z) 
= (— sin u)(4) 
= —4sin(4xr) (since u = 42). 


(b) The derivative of g(x) = In(x? + x +1) is 
(e) = (Sa) C+D 
ae hae eae 
eet 
gt tot: 
Expanded version: In this case 
g(x) = In(x? + x + 1) = q(p(x)), where 
u=p(r) =27+2+4+1 and q(u) =Inu. Now 
p(x) = 2x2+1 and q’(u) = 1/u, so we have 
g(x) = F(p(@))p' (x) 
= 7 (u)p'(2) 
(where u = p(x) = 2? +241) 


_ (=) Can 


2 1 
= a (since u = 2* +241). 
(c) The derivative of u(t) = sin(cost) is 
v(t) = (cos(cost))(— sin t) 
= — sintcos(cost). 
Expanded version: In this case 
u(t) = sin(cost) = q(p(t)), where u = p(t) = cost 
and q(u) = sinu. Now p’(t) = —sint and 
q'(u) = cosu, so we have 
u'(t) = q'(p(t))p'(t) 
= q'(u)p'(t) (where u = p(t) = cost) 
= (cos u)(— sin t) 


= —sintcos(cost) (since u = cost). 


Solutions to Exercises 


Solution 1.1 


Result (1.1) states that if f(x) = ax? + br +c, then 
f'(z) = 2ax 4+ b. For y = f(z) =6+ 5x — 42, we 
have a = —4, b= 5 and c = 6, so that 


f(a) = —8xa +5. 
Hence the gradient at the point (2,0), where x = 2, 
6 f(2) = 8x teh =-16 
Solution 1.2 


(a) Putting the left-hand side fractions over the 
common denominator (x + h)x, we have 


i Bx & cn 
tc+h «a (a+h)c (c&t+h)e 
x —(x+h) 
(x +h)ax 
- h 
— (a@ + h)a’ 


as required. 


(b) By equation (1.3), and using the result of 
part (a), we need to consider the quotient 


fleth)—fl(x) 1 ( 1 3 


h R\2+h «£ 
1 h 
=5( ae 
1 
(a +h)x 


For small values of h, the quotient —1/((x + h)x) 
is close to —1/ax?. We conclude that 
ii 
oe. 
(c) Result (1.4) states that if f(x) = 2”, then 
f' (2)= na”. Mere fis) = 1/s =2 * 50 that 


n = —1. The derivative is therefore 
f(s) =-la 2 Ss, 
which is often written as f’(2) = —1/2*, in 


agreement with the result of part (b). 


Solution 1.3 


(a) The position function is s = f(t) = 3t? +t. This 
is a quadratic function of t, so we apply 
result (1.1) to obtain the velocity function 


v= f(t) =6t+4+1. 


(b) When t = 10, we have 
v = f’(10) =6 x 10+ 1 =61, so the velocity is 
then 61 ms~?. 

(c) When v = 25, we have 25 = 6¢ + 1, which has 
solution t = 4. Hence the aeroplane has velocity 
25 ms‘ after 4 seconds. 


Solution 2.1 


(a) From result (1.4), the derivative of x'°" is 
10121, the derivative of x°° is 50x*° and the 
derivative of x (= z') is 1 (= 2°). From 
result (2.3), the derivative of the polynomial 
function f(x) = 7x19! — 132°° + z is 

f'(x) = 7(1012"°°) — 13(50x*°) + 1 
= 707x' — 650a*? + 1. 


(b) From results (1.4) and (2.3), the derivative of 
the polynomial function 
f(x) = 5x? + 4x? — 9x — 8 is 


f'(xz) = 5(3x7) + 4(2x) — 9(1) 


= 1527 + 8x —9. 
The point (—1,0) on the graph of y = f(z) 
corresponds to x = —1, for which the gradient is 


f'(—1) = 15(—1)? + 8(-1) — 9 = -2. 


Solution 2.2 


(a) The derivative of the function 
f(x) = —x? — 3x7 + 9x + 5 is 
f'(z) = —3x" — 62 + 9 = —3(2 + 3)(x — 1). 
Solving the equation f’(x) = 0, we find that the 


stationary points of f are at zy = —3 and z= 1. 


(b) (i) Step 1 of the strategy has been carried out 
in part (a). Step 2 is as follows. 


To classify the stationary point at zr = —3, we 
choose test points zy, = —4 and zp = 0, say. 
Then we have 


f'(zy) = f'(-4) = —3(-1)(—5) = -15 < 0, 
@en=FO=F30 


so f has a local minimum at x = —3, by the 
First Derivative Test. 


To classify the stationary point at x = 1, we 
choose test points zy, = 0 and rp = 2, say. Then 
we have 


f'(a.) = f' 0) =9 > 9, 
f'(er) = f'(2) = —3(5)(1) = -15 < 0, 


so f has a local maximum at x = 1, by the First 
Derivative ‘Test. 


(ii) The second derivative of f is 
f"(2) = -6x — 6 = —6(x + 1). 

For the stationary point at x = —3, we have 
f"(—3) = -—6(—2) = 12 > 0, 


so f has a local minimum at x = —3, by the 
Second Derivative ‘Test. 


For the stationary point at x = 1, we have 
f" (1) = —6(2) = -12 < 0, 


so f has a local maximum at x = 1, by the 
Second Derivative Test. 


The y-coordinates of stationary points on the 
graph of y = f(x) are 

f(—3) = 27 — 27 — 274+5 = —22, 

f() =-1-34+9+4+5=10. 
Also f(0) = 5. Hence the graph passes through 
the points (—3, —22), a local minimum, and 
(1,10), a local maximum. It also passes through 
(0,5); that is, it cuts the y-axis at y=5. A 
sketch of the graph is as follows. 


(—3, = 


Figure 8.2 


Solution 2.3 
(a) The second derivative of f is obtained by 


differentiating f’(x) = 3x7 — 122 — 15, to obtain 
f' (a) =62 = 12=@(¢ —2). 

For the stationary point at x = —1, we have 
f"(—1) = 6(-3) = -18 < 0, 


so f has a local maximum at x = —1, by the 
Second Derivative Test. 


For the stationary point at x = 5, we have 
f° (3) =@a)-= 16 > 0, 


so f has a local minimum at x = 5, by the 
Second Derivative Test. 


_ SOLUTIONS TO EXERCISES — 


(b) In each case, we follow the steps of the 
Optimisation Procedure, but Step 1 has already 
been completed. The stationary points of the 
function f(x) = 2° — 6x” — 15¢ + 54 are at 
x = —1 and x = 5. 


(i) Step 2: At the endpoints of the interval 
I = |0,7], the function values are 


j(0) =... amt Fit) = 2, 


while for the stationary point at « = 5 (which is 
inside J) the function value is 


f(5) = —46. 
(Note that, on this occasion, the other 
stationary point (at x = —1) is not inside J, and 


hence does not feature at this stage.) 


Step 3: The greatest value of f(x) on [0, 7] is 
54 (at « = 0) and the least value of f(a) on [0, 7] 
is —46 (at x = 5). 


(ii) Step 2: At the endpoints of the interval 
I = |—4, 7], the function values are 


f(—4) =-46 and f(7) =-2, 


while for the stationary points (both of which 
are inside J) the function values are 


f(-—1)=62 and f(5) = —46. 


Step 3: The greatest value of f(a) on [—4, 7] is 
62 (at 2 = —1) and the least value of f(x) on 
|—4, 7] is —46 (at both x = —4 and x = 5). 


Solution 3.1 
(a) The derivative of u(z) = 4sinx — 5cosz is 
u(x) = 4cosa2 + 5sina. 


(b) The derivative of v(x) = 7Inx — 2e” is 


7 
‘(x) = — — 2e”. 
v (x) 7 2e 
(c) Since 1/x = x~* has derivative —x~* = —1/z?, 
and 1/x? = x? has derivative —22~* = —2/z°, 


the derivative of 
w(x) = 327 — 22 +1 —-1/x24+ 2/2? is 
1 4 


hes. 
w (a) = 6n-2+ 4 — -G- 


Solution 3.2 


(a) The first and second derivatives of 
P=cosu-—sinwu with respect to u are 
dP rP 
— =-—sinu—cosu, —> 
du du? 


(b) The first and second derivatives of 
v = —e' + 17t + 3 with respect to t are 


= —cosu+sin u. 


oa —e $17, t= =e". 


(c) The derivative of w = 5s* — 10s? with respect 


to s is 
d d 
a a | a ee 
ds os 


Solution 4.1 
(a) The derivative of f(x) = (2° + x) sina is 
f'(x) = (3x7 + 1)(sinz) + (x* + x)(cos 2) 
= (327 + 1)sine + (x° + x) cos. 
(b) The derivative of g(x) 
g' (x) = (e”)(cos x) + (e”)(—sinz) 
= e*(cosx — sin 2). 


(c) The derivative of Vt = t!/? is st 1/2 = 1/(2Vt). 
Hence the derivative of u(t) = Vt Int is 


u(t) = Ga (Int) + (vi) @ 


Int+ 2 
ce ee 


=e COST 


Solution 4.2 
(a) The derivative of g(x) = x*/Inz is 
1.) _ (Ina) (40°) — (x*)(1/2) 
g (x) — (In x)? 
eae = 1) 
=. faa 
(b) The derivative of k(t) = (sint — t?)/e’ is 
ray __. (e*)(cost — 2t) — (sint — t*)(e*) 
k(t) = (et)? 
cost — sint + t? — 2t 
et 


1 
(c) The derivative of f(a) = is 


f(e) = (cos Soe as sin x) 


(Since 1/cosx = secz and sinz/cosz = tana, 
this shows that the derivative of f(x) = sec z is 
fiz) =secetans.) 


Solution 4.3 


(a) The derivative of k(x) = sin(Inz) is 


1 
k(x) = (costli ze) (=) 
es 
Expanded version: In this case 
k(x) = sin(iIn az} = g( j(@)); where 
wu = f(x) =Inz and giv) = sinu. Now 


f'(z) =1/2z and g’(u) = cosu, so we have 


k'(a) = g'(f(x)) f(a) 


_ cos(In 2) 
aaa 


= g'(u)f'(x) (where u= f(z) =Inz) 
= (cos u) | 
= ——- (since u = Inz). 


(b) The derivative of k(x) = sin* x = (sin x)* is 
k'(ax) = (4(sin x)?) (cosx) = 4 cos asin’ «. 
Expanded version: In this case 
k(x) = (sinx)* = g(f(x)), where 
u = f(x) =sinz and g(u) = u*. Now 
f'(z) = cosz and g’(u) = 4u?, so we have 
ki(x) = g'(f(x))f'(@) 
= g'(u)f'(x) (where u= f(r) =sinz) 
= (4u°)(cos x) 


=Acoszsin°x (since u=sin7z). 


(c) The derivative of k(x) = eV” is 


1 evr 
/ = Vax —_—— == 
k’ (a) (c (=z) Ws 
Expanded version: In this case 
k(x) = eV* = g(f(ax)), where 
u = f(z) = Ja = 2)? and g(u) = e*. Now 
fl(a) = 4a ¥/? = 1/ (2) and g/(u) =e", s0 


we have 


(x) (where u = f(x) = Vz) 


(d) The derivative of p(t) = In(e’ + f) is 


1 e +1 
ee ee ; 
VO = (aq) += SS 
Expanded version: In this case 
p(t) = In(e' + t) = g(f(t)), where 
u = f(t) =e’ +t and g(u) = Inu. Now 
f(t) =e’ +1 and g’(u) = 1/u, so we have 


P(t) =9 (FO) FM 
=g'(u)f'(t) (where u = f(t) =e' +1) 


= @ (et + 1) 


4] 
= — (since u = e’ + #). 


The derivative of q(s) = cos(1/s7) is 


q'(s) = (-sin (=)) (-3) = —— 


Expanded version: In this case 
q(s) = cos(1/s*) = g(f(s)), where 
w= f(s) =1/3* = s~* anergla) = cceu, Tew 
f'(s) = —2s—° = —2/s° and g’(u) = —sinu, so 
we have 

q'(s) =9'(f(s))f'(s) 
g'(u)f'(s) (where u = f(s) = 1/s*) 


ae (-3) 


_ 2sin(1/s?) 
ee 


(since u = 1/s*). 


The derivative of r(y) = 1/ cos? y is 
2 2sin y 
/ SF Glee: oe = 
fw) = (-3-) sin) = 4 
Expanded version: In this case 


r(y) = 1/cos* y = g(f(y)), where 

u = f(y) =cosy and g(u) = 1/u? = u~*. Now 
f'(y) = —siny and g’(u) = —2u~° = —2/u°, so 
we have 


ry) =g9 (fy) fy) 
=g(u)f(y) (where u = f(y) = cosy) 


2 ; 
_(-2)-aan 
2 sin y 
=. (since u = cosy). 
cos? y 


(Note that this case involves the same 
constituent functions as in part (e), but applied 
in the reverse order.) 
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